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The main purpose of this paper is 
to examine the work on the structure 
of groups or teams. The emphasis 
will be on communication structure, 
an aspect of group behavior that has 
received much attention in the recent 
experimental and theoretical litera- 
ture. The term “structure” refers 
here to a relationship in a group, e.g., 
“communicates to.” 

The following questions initiated 
this survey: 

1. How can the interactions or 
communications of a group, its struc- 
tural characteristics, be measured? 

2. How are structural character- 
istics related to group performance? 

Several areas contribute answers. 
The areas include sociometry, the 
mathematical techniques growing out 
of sociometry, and the group network 
studies. The contribution of mathe- 
matical techniques in answering the 
first question will be considered here. 
Some answers to the second question 


1 Prepared under Contract N7 onr-37008, 
NR-154-079, between the American Institute 
for Research and the Office of Naval Research, 
Psychological Sciences Division, Personnel and 
Training Branch, as part of a research project 
on team training and performance. The 
authors wish to thank Ardie Lubin and R. 
Duncan Luce for their helpful reviews of the 
paper. 

2 Now at Walter Reed Army Institute of 
Research. 


will be considered in a subsequent 
paper. 


SOCIOMETRIC AND RELATED 
TECHNIQUES 


There has been a considerable de- 
velopment of techniques for the 
description and analysis of the rela- 
tionships between group members. 
The initial stimulus for this work 
came from the area of sociometry. 
Two major changes have, however, 
occurred in these techniques since 
their inception. 

1. Originally, they were concerned 
solely with the pattern or structure 
of likes and dislikes within a group. 
They soon developed, however, to in- 
clude any pattern of relation. 

2. Originally, they were primarily 
graphical. Now they include the use 
of mathematical techniques. 

Examples of the original approach 
can be found in the work of Moreno 
(1934) and Jennings (1950). The so- 
ciogram or graphic presentation of 
the relationships in a group will not 
be considered here. Moreover, since 
many of the indices developed out of 
earlier sociometric work are closely 
tied to the use of the sociogram, and 
also because they have been sum- 
marized in other surveys (Lindzey & 
Borgatta, 1954; Proctor & Loomis, 
1951), the emphasis here will be on 
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the mathematical approaches. The 
survey by Proctor and Loomis (1951) 
gives a full treatment of the work on 
indices and statistical analysis that 
stems directly from the sociometric 
tradition. 

Work on the techniques develop- 
ing out of sociometry can be cate- 
gorized into the following areas: (a) 
construction of indices for group and 
individual characteristics; (6) enu- 
meration of structures; (c) comparison 
of groups; (d) analysis of subgroups; 
(e) assignment of individuals to sub- 
groups; (f) other approaches: graph 
theory, logic of relations. Although 
in most cases the techniques were de- 
vised for the relationships of choice 
and rejection, they can be applied di- 
rectly to relationships such as “‘inter- 
acts with” or ‘‘communicates with.” 
One of the aims of this survey is to 
set forth the available techniques and 
to indicate their possible application. 
Most of the techniques discussed 
here have not been applied exten- 
sively. Their practical and theoreti- 
cal usefulness cannot, therefore, be 
definitively evaluated at this point. 

The first major step in the mathe- 
matical treatment of sociometric ma- 
terial was noting that it could be 
cast in the form of matrices, and the 
operations of matrix algebra applied 
to it. This step was made by Forsyth 
and Katz (1946). Group structure is 
summarized by a matrix with cells, 
a;;. The subscripts 7 and 7 identify 
the row and column, respectively, of 
the cell entry. If a given relationship 
exists between individuals i and j 
(e.g., 7 speaks to j) then a;; #0. If not, 
then a;;=0. The treatment of the 
diagonal, a,x, depends on the pur- 
poses of the investigator. In most 
cases, zeroes will be entered in the 
diagonal. 

The simplest type of matrix to 
summarize the relations within a 


MURRAY GLANZER AND ROBERT GLASER 


group is one that simply notes 
whether or not the given relationship 
exists between a pair of individuals. 
In this case a4;=1 or 0. For ex- 
ample, if in a four-man group it is 
found that a speaks to (or likes) c, b 
does the same to a, c, and d; ¢ to b 
and d; and d to a and b; the matrix 
would appear as in Table 1. 


TABLE 1 
MATRIX WITH BINARY ENTRIES 








Receiver 








a b c d 
~ 2 ns 0 1 0 
so »b 1 —_ 1 1 
5 ¢ 0 1 _— 1 
B d 1 1 0 _- 





In some cases, the investigator is 
dealing with a relationship that 
varies in strength or frequency. For 
example, it may be that b speaks fre- 
quently to a, infrequently to c, and 
frequently to d, etc. In this case, the 
matrix might appear as in Table 2. 
The positive cell entries have been 


TABLE 2 
MATRIX WITH WEIGHTED ENTRIES 





a b c d 

a — 0 1 0 

b 2 — 1 2 

c 0 1 — 2 

d 2 2 0 _— 
weighted according to frequency. 


Here a;;>0 if the relation exists be- 
tween 7 and 7}. 

In other cases, the investigator is 
dealing with a bipolar relationship 
that can be placed on a continuum 
(e.g., strongly rejects, rejects, does 
not respond to, chooses, strongly 
chooses). In such a case, a;;#0 if in- 
dividual 7 responds to j7. The sign of 
the entry may then be used to indi- 
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cate whether the individual accepts 
(positive) or rejects (negative) an- 
other individual. The absolute size of 
the entry is used to indicate strength 
of the response. 

Suppose, for example, that in a 
three-man group a rejects ¢ and 
chooses b; b strongly chooses ¢ and ¢ 
strongly rejects both a and b. The 
matrix would then appear as in Table 
3. 

TABLE 3 


MATRIX WITH WEIGHTED BIPOLAR ENTRIES 


a b c 
a — 1 —1 
b 0 — 2 
c —2 —2 _— 


Katz (1947) has indicated some of 
the possibilities opened up by the use 
of matrices. These include tech- 
niques for writing equations describ- 
ing over-all changes in group struc- 
ture. As will be seen below, matrices 
also permit many types of complex 
analysis by relatively simple opera- 
tions. 


CONSTRUCTION OF INDICES FOR 
GROUP AND INDIVIDUAL 
CHARACTERISTICS 

Sociometry has been prolific in the 
construction of indices. The mean- 
ing of these indices is usually fairly 
clear and their computation rela- 
tively simple. A typical index is one 
cited (Proctor & Loomis 1951) as a 
measure of group cohesion. The in- 
dex is the number of mutually chosen 
pairs divided by m!/(m—2)! 2!, the 
number of ways that a pair of indi- 
viduals can be drawn from a group of 
n individuals. Since indices of this 
type have been presented in detail by 
Proctor and Loomis (1951), only 
those not discussed elsewhere are con- 
sidered here. 
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Group Indices 


Several indices of group charac- 
teristics have been suggested for the 
extent to which the group is centered 
on a small number of individuals. 
These indices are based on the vari- 
ance of the column sums of the choice 
matrix. One of these is the index 
of concentration suggested by Katz 
(1954). 

Hohn (1953) has developed an in- 
dex based on the ratio between ob- 
tained variance of column sums (s) 
and the maximum possible variance 
for the special case of the matrix with 
weighted entries in which each indi- 
vidual ranks all other members of the 
group in order of preference. The 
index is called the hierarchy index 
with the following formula. 


12 


1 = — — 
n(n?—1)(n—2)? 


oa an 
(xs. 4 lt] 


Landau (1951a) earlier developed a 
similar hierarchy index for another 
special case, that of dominance rela- 
tions. 


Individual Indices 


Much sociometric work is con- 
cerned with specifying characteristics 
of the individual in the group. For 
example, the total number of choices 
received and made by an individual 
is used to indicate his “popularity” 
and his “outgoingness’’ respectively. 
If Table 1 were a matrix of com- 
munications, b would be the most 
productive of output and receive the 
same amount of input as the other 
members. 

The simplest technique in describ- 
ing the individual is to use the sum of 
the rows and columns as above. This 
technique, however, makes it difficult 
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to compare individuals in groups of 
different sizes (Criswell, 1950; Ed- 
wards, 1948). Various refinements 
have therefore been introduced; such 
as, weighting the sum for each indi- 
vidual by the maximum number it is 
possible for him to send or receive. 
Another difficulty with the use of a 
simple row or column index is that it 
does not take into account indirect as 
well as direct connections between 
individuals. There may be an im- 
portant difference between the indi- 
vidual who is chosen by three group 
members who are themselves chosen 
by many others and the individual 
who is chosen by three “‘isolates.”’ 


Status Index 


Katz (1953) has developed a status 
index that takes account of such in- 
direct links. He makes use of the fact 
that in a matrix with binary entries, 
as in Table 1, the powers of the 
matrix give the number of indirect 
connections to each member of the 
group. Thus, if the matrix in Table 1 
has zeroes placed in the diagonal and 
is squared, that is, multiplied by it- 
self, the matrix in Table 4 is ob- 
tained. The entries in the squared 
matrix indicate the number of two- 
link connections between each pair of 
group members. For example, there 
is one two-link connection between 
a and b (a—c—>b), none between a 
and c, and two between c and a (cb 
—a and c—d—a). Cubing the orig- 
inal matrix would give a matrix all 
of whose entries are positive, indicat- 
ing that every member of the group 


TABLE 4 
MatTRIx OF Two-LInK CONNECTIONS 


0 
1 
2 
1 


corte 
pero 
i 





MURRAY GLANZER AND ROBERT GLASER 


has one or more three-link connec- 
tions with every other group member. 
The status index, 7, for each indi- 
vidual may be computed as the total 
of all direct and indirect links to the 
individual. These may be obtained 
from the column sums of the original 
matrix, plus those of all the powers 
of the matrix. One might consider, 
therefore, the column sums of 


T=A+A2+A8+ ---+At+.--- 
=(I—A)“-—I [2] 


where A is the original sociometric 
matrix with zeroes in the diagonal 
and J is the identity matrix. 

Katz suggests, further, that in- 
direct links be weighted inversely to 
the number of links involved. To do 
this a constant ¢ is employed with 
O0<c<1. The following formula then 
gives the matrix of summed and 
weighted values: 


T=cA+c?A?+ eee +ckA*+ es * 
=(I—cA)!-I [3] 


Katz derives an equivalent formula 
for the computation of status that 
finds the solution through a set of 
linear equations rather than by find- 
ing the inverse of a matrix explicitly. 


(= I-A’) 1=s [4] 


where A’ is the transpose of A, and 
S is a column vector whose entries 
are the column sums of A. The for- 
mula yields a set of linear equations 
which may be solved for t, the sums of 
the columns of 7. Katz also presents 
a formula for the weighting of ¢ ac- 
cording to the number of possible 
choices in the group. 


Leontieff Matrices 


Hubbell, by working explicitly 
with the inverse matrix, develops 














GROUP STRUCTURE AND BEHAVIOR 


much more extensive information 
from an approach similar to Katz. 
Hubbell points out that matrices 
summarizing relationships within a 
group have been used by Leontieff in 
his input-output or interindustry 
models. He therefore transfers the 
Leontieff techniques to the socio- 
matrix. The techniques are aimed at 
tracing the long-run effect of each 
member upon the others. In order to 
do this, use is made, as. Katz does 
above, of the fact that (J— A)- gives 
the sum of all the powers of A. 

An example of the approach pre- 
sented by Hubbell is the following: 
Suppose the matrix in Table 1 sum- 
marizes the relationships in a group. 
If the positive entries in the matrix 
are all changed to .25, this is the same 
as multiplying the matrix! by Katz’ 
constant, c=.25. The inverse of 
I—cA is given in Table 5. The in- 
verse matrix can be taken to sum- 
marize the eventual effect of each 
member on any other. Thus, one 
unit of activity on the part of 0 will 
eventually give rise to .40 units by a. 
The row sums indicate the long-run 
influence of each member. In this 
case, the row sum for 0 is the largest 
and that for a is the smallest. The 
column sums, here equal, may be 
taken to signify the amount of con- 
straint or pressure put on each mem- 
ber in the long run. 

Up to this point, the work is the 
same as that outlined by Katz (1953) 
with one major difference. The com- 
plete matrix of long-run influence is 


3 “Sociomatrix”’ will be used here and below 
instead of the longer “sociometric matrix.” 

4 This weighting could be rationalized by 
assuming that each member of the group 
gives only half his attention to what goes on 
within the group and that he distributes his 
attention equally among the inputs from other 
group members. The column sums are all, 
therefore, taken to equal .50, and the values 
in the cells changed accordingly. 
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TABLE 5 
Matrix or (J—cA) 

1.05 .09 .28 .09 

.40 1.20 .40 .40 

.19 .38 1.14 .38 

.36 oan .17 $.a3 
obtained. (Other differences such as 


not subtracting the identity matrix 
from the inverse matrix are unim- 
portant.) The use of the inverse 
matrix is significant for two reasons. 
First, it becomes easier to change the 
analysis from row sums to column 
sums. Second, it becomes possible to 
estimate the effect of various distri- 
butions of input from the external en- 
vironment. Thus, if each member re- 
ceives one unit of external input 
which he relays to members of his 
group, the eventual effect of each 
member can be computed by post- 
multiplying (I—cA)— by a column 
vector of ones. The column vector 
obtained is (1.51, 2.40, 2.09, 1.97) in- 
dicating that b is most influential. 
The effect of differences in the 
amount of external input can be sim- 
ilarly computed by postmultiplying 
with the appropriate column vector 
containing a different distribution of 
inputs. 

It is also possible to assign differ- 
ential weights to the likelihood of in- 
fluence (or communication) travel- 
ling along a particular channel. For 
example, it may be that ¢ pays more 
attention to a than to b. In this 
case, values in cells ai3 and d23 might 
be .42 and .08, respectively, instead 
of .25. When the inverse of J minus 
this new matrix is computed, it is 
found that the effects of the change 
are to increase the long-run influence 
of a and d and decrease that of b 
and ¢. 


5 This simply obtains the row sums dis- 
cussed in the preceding paragraph. 











322 


This technique is therefore of spe- 
cial interest in analyzing groups with 
respect to the interrelation of the 
members and the possible effects 
of changes in demands, pressures, or 
environmental contacts. It may be 
possible to pick out cases in which 
the imposition upon group members 
of a given distribution of demands 
from the environment so overloads 
some members of the group as to 
cause a breakdown. Changes in the 
distribution of demands or the struc- 
ture of the group may obviate this 
overloading. 


ENUMERATION OF STRUCTURES 


Work has been done in counting 
the number of graphs or matrices 
that display specified characteristics 
such as a given set of row and column 
totals. The work on enumeration is 
important in obtaining chance dis- 
tributions of various types of struc- 
tures. The work cited in this section 
concerns only matrices with binary 
entries. 


Number of Matrices with Given Row 
and Column Sums 

Katz and Powell (1954) attack the 
following enumeration problem: 
Given a matrix with certain row and 
column totals, how many distinct 
matrices with the same set of row and 
column totals in the same order may 
be generated? 

Solutions have been worked out 
and tables constructed by Sukhatme 
(1938), and by David and Kendall 
(1951), for matrices with binary en- 
tries. In the case of the sociomatrix, 
a restriction is usually introduced. 
The entries on the diagonal are either 
all zeroes or all ones. Katz has de- 
veloped a formula that permits the 
use of the tables for these special 
cases. 

The number of matrices (7) with 
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zeroes in the diagonal cells, and a 
fixed set of column sums (s) and row 
sums (r) is given by 


n 
ns n=A4 TT ats)-«, ob [5] 
i=1 

The operator 6; reduces entry 7 of 
the vectors of row and column sums 
by one. A is the number of all 
matrices, unrestricted with respect 
to the diagonal, that can generate 
the given set of column and row sums. 

The formula for n(s, r) is easy to 
apply since powers of 6 have to be 
considered only up to m; where m; is 
the smaller of (s;, 7,). This is ex- 
panded and the individual terms are 
evaluated using the tables mentioned 
above to evaluate the various A’s ob- 
tained through the formula. 


Number of Distinct Structures 


A different enumeration problem 
has been attacked by Davis (1953). 
Given a set of 7 elements, how many 
distinct structures of relationship are 
possible between them? Two struc- 
tures are distinct if they are not per- 
mutations of each other. In terms of 
matrices, two matrices are distinct if 
one cannot be obtained from the 
other by a simultaneous permutation 
of rows and columns. Davis develops 
formulae not only for the counting of 
the number of distinct structures, but 
also for the counting of specific kinds 
of relation structure, e.g., reflexive, 
symmetric, asymmetric, etc.6 The 
bounds for the number of structures 
are given as follows: 

2 


2n°-log n/log2< number of distinct 


2 
structures<2” [6] 
Davis, in a later paper (1954), ap- 
6 Davis’ paper concerns m-adic as well as 
dyadic structures. Only the dyadic structures 


are considered here. See also Copi and Harary 
(1953). 
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plied his formulae for the counting of 
structures to the special case of 
dominance relations. 


Distribution of Subgroup 
Configurations 


Closely related to the enumeration 
of group structures is work on the 
distribution of various subgroup 
structures. A large number of papers 
deal with the expected number and 
the distributions of various configura- 
tions on the basis of chance. This 
work has been done for stars (wheels), 
chains, rings (circles), various types 
of cliques and isolates. Much of this 
work has been carried forward by 
Katz (1952a; Katz & Olkin, 1952; 
Katz & Powell, 1957). It has im- 
portance in evaluating the results of 
the analysis of configurations. For 
example, if a given group has three 
isolates, it is of interest to discover 
how often this could have arisen by 
chance. However, a_ psychological 
theory or rationale to dictate the 
choice of configurations for study and 
to indicate why there should be de- 
partures from chance ordering has 
not yet been developed. 

If and when theory and experi- 
mental investigation develop in which 
group structure is a dependent varia- 
ble, then the distribution of particu- 
lar configurations will have consider- 
able importance. At the present time, 
however, configurations and _ their 
distribution seem to have little the- 
oretical or practical significance. 


COMPARISON OF GROUPS 


The question of the similarity of 
two matrices is important for either 
the evaluation of the amount of 
change in a group or the departure of 
the actual pattern from an ideal or re- 
quired pattern. The techniques pre- 
sented thus far for measuring sim- 
ilarity are basically correlational. 
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They involve pairing of correspond- 
ing entries in two matrices and cor- 
relating the values found. 


Comparison of Matrices Using Cell 
Entries 


When a matrix has binary entries 
there are only four ordered pairs pos- 
sible for the corresponding cells: (0, 
0), (1,0), (0, 1), and (1, 1). Katz and 
Powell (1953) therefore use a four- 
fold table to summarize the agree- 
ments and disagreements of entries in 
two binary matrices. They then con- 
struct an index of conformity based 
upon the observed cell frequencies: 


r= - [n(n—1)n4p—nanz| [7] 


NAnp 

where n(n—1)=number of off-diag- 
onal cells, 24 =number of positive en- 
tries in matrix A, mg=number of 
positive entries in B, n;=number of 
zero entries in B and n4zg=number of 
cells positive in both A and B. The 
index ranges between land — (1,2/np) 
and equals 0 when A and B are inde- 
pendent. The f for the agreement of 
A with B and B with A is not neces- 
sarily the same. For situations in 
which it is not assumed that one of 
the matrices is antecedent, the geo- 
metric mean of the two possible in- 
dices may be computed. The index is 
called C, a coefficient of concord- 
ance, 


/ 


C= Vint = 





. n(n—1)nap—Nanp | 
. —__—____—_—_ [g 

V NANEN BNA 
It ranges between +1 and —1. 

Katz and Powell point out that the 
same approach can be used to com- 
pare an individual’s choices within 
matrices A and B (both describing, 


of course, the same group). A four- 
fold table is constructed on the basis 
of two corresponding rows of the ma- 
trices and the same indices computed. 
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Comparison of Matrices Using Row or 
Column Sums 


Another type of comparison of 
matrices can be made by focusing on 
the row of column sums. This is 
easily accomplished by computing a 
product-moment correlation coeffi- 
cient for the paired row sums or the 
paired column sums. 

Hohn (1953) has discussed in de- 
tail the comparison of sociomatrices 
for the special case in which each 
individual ranks all others and zeroes 
are inserted in the diagonal. For this 
case the product-moment correlation 
between column sums may be written 
as follows: 


> sisi’— 
‘ 
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inner product, divided by the product 
of their magnitudes. 


Dd ad 
Vf roy re 


When a and 0 are deviations from 
their respective means, this is the 
standard correlation coefficient. To 
make use of either, it is necessary to 





cos Oa8= [10] 





- assign values to the diagonal entries 


in the matrix. (It would probably be 
best to assign the highest possible 
positive cell value to all cells in the 
diagonal.) It is then possible to 
measure the agreement between the 





n?(n—1)? 


4 





n(n— 1)? 


Vemma 


where s; and s,’ are the column sums 
for the 7th column of two matrices. 

Similar correlations could be ob- 
tained for row sums or column sums 
in the case of any type of socio- 
matrix, i.e., matrices with any type 
of weighted entries. In these cases, 
however, the general formula for the 
product-moment correlation coeffi- 
cient should be used. 


Generalization of Correlational 


Approach 


Katz (1947) has outlined a general 
method for evaluating agreement be- 
tween the patterns of individual 
choices. The measure of agreement 
considered is based on the angle be- 
tween vectors of choices; such as 
(0, 0, 1, 0) the first row vector in 
Table 1 (when zero is inserted in the 
diagonal cell). 

The cosine of the angle between 
two vectors a and 8 is equal to their 





5  #(n—1)2\ 
)( 2s _ * ) 


[9] 








vector of responses generated by each 
individual in the group with that of 
every other member of the group. 


ANALYSIS OF SUBGROUPS 


A large group often is assumed to 
have distinguishable subgroups 
within it. In certain organizations 
these subgroups are officially desig- 
nated, e.g., the board of directors of 
a company. In other organizations, 
they are neither official nor immedi- 
ately obvious to the outsider, e.g., 
factions within a_ political party. 
There have been several attempts to 
move from the intuitive grouping of 
individuals to a system for the unique 
segregation of subgroups on the basis 
of clear-cut criteria. The general in- 
terest of such a procedure is evident. 
In the analysis of man-machine sys- 
tems, for example, it would be desir- 
able to have a straightforward tech- 
nique for the identification of natural 
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groupings (of men and machines). It 
is also of interest to the student of 
administrative organization, for ex- 
ample, to compare officially desig- 
nated subgroups with the actually 
functioning subgroups in an _ or- 
ganization. 


Matrix Manipulation and Reduction 
Method 


There are several techniques avail- 
able for the analysis of subgroups. 
The first one suggested was that of 
Forsyth and Katz (1946) for matrices 
with bipolar entries (a;;=0 or +1 or 
—1). It can also be applied to the 
simpler case with binary entries (a;; 
=Oor +1). 

The Forsyth-Katz technique in- 
volves simultaneously switching the 
rows and columns of the matrix so 
that positive entries cluster about the 
diagonal, i.e., their distance from the 
diagonal is minimized. If there are 
negative entries their distance from 
the diagonal is maximized. The tech- 
nique helps identify cliques (clus- 
ters), leaders (center of cluster), and 
rank of cliques. The steps may be 
summarized as follows: 

1, Place in Rows 1 and 2 (and Columns 1 
and 2) a pair of individuals who choose 
each other: 

2. Place in Row 3 (and Column 3) an in- 
dividual who has mutual choices with 
both or, failing that, is chosen by both of 
the preceding pair. 

3. Continue adding individuals in this 
fashion, using as criterion the require- 
ment that each new member is chosen by 
at least 50% of the people already in- 
cluded in the subgroup. 

4. Remove the subgroup when no more can 
be added and repeat the process with the 
reduced matrix. 

After all the subgroups have been 
selected in this fashion, Forsyth and 
Katz specify methods for arranging 
the entire group along the diagonal 
(so that subgroups that reject each 
other are most widely separated) and 
for placing individuals who have not 
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been included in any subgroup. The 
final manipulation is to arrange the 
members of each subgroup so that 
those receiving the greatest number 
of choices are at the center. The 
Forsyth-Katz technique requires ra- 
ther awkward manipulation of ma- 
trices. Katz later developed a 
punched card technique for this an- 
alysis (1950). Methods for expedit- 
ing the rearrangements can also be 
adapted from the scalogram board 
technique developed for Guttman’s 
scalogram analysis. 


Diagonal Maximization Method 


Beum and Brundage (1950) have 
developed a more systematic tech- 
nique for carrying out the rearrange- 
ment of a matrix with positive 
weighted entries. The technique is 
based on the following idea. 

“It can be shown that if weights 
are assigned to the rows of a socio- 
matrix ...and the average product 
of the elements in each column and 
the corresponding weights is maxi- 
mized for each column, the sum of the 
squares of the elements about the 
principal diagonal is minimized.”’ The 
technique as outlined below requires 
that the system of weighting indi- 
vidual choices or relationships gives 
the heaviest weight to the most im- 
portant choices. The steps are as fol- 
lows: 

1. Zeros are inserted in the diagonal cells 

and column sums are obtained. 

2. Weights of 1 to N are assigned to the 
rows of the matrix, beginning with 1 as 
the weight of the bottom row. 

3. All matrix entries are multiplied by their 
row weight and weighted column sums 
are obtained. 

4. The weighted column sums are divided 
by the unweighted column sums to ob- 
tain the average weight. 

5. The matrix is then rearranged in order 
of average weights so that the column 
with the largest average is moved to the 
extreme left and the corresponding row 
is moved to the top. The column with 
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the next largest average is placed second 
from the left, etc. 


The procedure is repeated until 
further iteration does not change the 
arrangement or results only in an 
alternating arrangement of the col- 
umns. The procedure is related to 
that used in obtaining a latent vector 
of the matrix. Its success depends on 
the absence of overlap in the sub- 
groups. 


Matrix Multiplication Method 

A technique designed for the an- 
alysis of subgroups in binary matrices 
has been presented by Festinger 
(1949) and developed extensively by 
Luce and Perry (1949). The tech- 
nique as first analyzed 
“‘cliques.’’ A clique was defined as a 
subgroup of three or more members 
all of whom were symmetrically re- 
lated. The clique includes all mem- 
bers who meet the requirement of 
symmetrical relationship. For ex- 
ample, an individual is a clique mem- 
ber if he communicates with every 
member of the group and every mem- 
ber of the group communicates with 
him. This stringent requirement is 
relaxed in a later development by 
Luce (1950). 

In this method, the matrix sum- 
marizing the group’s relationships is 
reduced to an S-matrix consisting of 
all entries which are symmetric about 
the diagonal. This means eliminating 
all but mutual choices from the orig- 
inal matrix. The new S-matrix is 
then squared and cubed. The diag- 
onal entries of S* give the number of 
mutual friends for each individual. 
The diagonal entries of S* indicate 
whether an individual is a clique 
member. The magnitude of the off- 
diagonal terms in both S* and 5S in- 
dicates the compactness of the entire 
group. 

The following theorems are pre- 
sented and proved by Luceand Perry. 


proposed 
’ 
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1. The magnitude of the cell entries a;; 
in A® indicate the number of distinct 
n-chains from 7 to j.7 An n-chain is a set 
of (n+1) interrelated (interconnected) 
members. Thus, a speaks to b who 
speaks to ¢ is a 2-chain. 

2. A group member ¢ has a main diagonal 
value of m in A? if, and only if, he has a 
symmetric relationship with m members. 

3. A member 7 is contained in a clique if, 
and only if, his main diagonal entry in S* 
is positive. 

4. If there are ¢ diagonal entries in S* each 
of which equals (¢—2) (t—1) and the re- 
maining diagonal entries are zero, then 
the ¢ individuals form a clique. 

Luce and Perry also present a gen- 
eral formula relating the magnitude 
of entries for member 7 in the di- 
agonal of S* to the number, overlap, 
and size of the cliques of which 7 is a 
member. The formula and its proof 
are, however, incorrect. They apply 
only to special conditions of overlap 
in cliques. 

On the basis of the theorems, the 
following procedure may be used to 
analyze a group: 

1. Find the row in S* with the smallest or 

one of the smallest diagonal entries. 

2. Choose those members whose entries for 
that row are the highest. The number 
chosen will depend on the size of the 
diagonal entry. (If the diagonal entry 
were 12=(5—2) (5—1) then the 4 high- 
est entries would be selected.) 

3. Examine the set selected in the S-matrix. 
If they all have positive entries in each 
other’s row, then they form a clique. 

4. Repeat the process with the next highest 
(or equal) diagonal entry until each of 
the clique members has been assigned to 
a clique. 

5. Check the final set of cliques to make 
sure that each clique member is included 
in all cliques to which he belongs. 

Luce and Perry also present theo- 
rems demonstrating that the proper- 
ties of S are not lost in the process of 
cubing it, and a method for the com- 
putation of redundant 3-chains. A 
redundant n-chain is one in which 
the same individual appears more 


7 This relationship is the basis for Katz’ 
status index. See Table 4. 
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than once. Thus, a redundant 2- 
chain would be a speaks to 6 who 
speaks to a. The work on redun- 
dancies, which is necessary for the 
identification and enumeration of 
chains and circles in the group, has 
been carried further by Katz (1952b) 
and Ross and Harary (1952). 


Generalized Matrix Multiplication 
Method 


In a subsequent article, Luce 
(1950) generalized the method by re- 
laxing the definition of clique. He de- 
fined an n-clique as a subgroup of 
individuals who are all » or fewer 
links (choices, communication links) 
from each other. This extends greatly 
the utility of the approach. 

The matrix manipulation for the 
analysis of n-cliques is closely related 
to the operations given above for the 
analysis of cliques. (They would now 
be called one-cliques.) Only the first 
few steps are different. 

1. Compute the 

A}, e+ Ar 
2. Add these together. 
3. Take the cells of the summed matrix, re- 
place every positive entry with a one, 
and replace all diagonal entries with a 
zero. 

. From this point on, proceed as in the 

original matrix multiplication method. 


matrix powers A, A?, 


The techniques developed by Luce 
have the advantage of requiring rela- 
tively simple procedures. They have 
one major disadvantage in that they 
severely limit the type of information 
they can handle. As developed, they 
can deal only with the presence or ab- 
sence of a relationship, i.e., binary en- 
tries. Indications of degree of rela- 
tionships (e.g., likes very much or 
communicates infrequently) cannot 
be handled. It is, of course, possible 
to reduce weighted entries to binary 
entries by dichotomizing them ac- 
cording to some criterion. 

An example of the successful ap- 
plication of the matrix multiplica- 
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tion method may be found in a study 
by Chabot (1950). In the study, 
groups in an industrial situation were 
analyzed to test hypotheses concern- 
ing the relation of group member- 
ship to production. 


Vector Analytic Methods 


It was noted earlier that the inner 
product of vectors of choices can be 
used to compute a correlation 
efficient. By taking each row vector 
that makes up the matrix of relation- 
ships, the matrix can be easily con- 
verted into a matrix of correlations. 
The entire body of factor analytic 
techniques can then be brought to 
bear upon the data. 

Bock and Husain (1950) have pre- 
sented another special type of factor 
analytic approach. They start with a 
choice matrix summarizing the sub- 
ject’s ranked preference for other 
group members. On the basis of the 
rank and whether or not the choices 
are mutual, weights are assigned to 
each cell entry. These weights are 
then analyzed by means of Holzing- 
er’s B-coefficient technique. 

Rather than converting the vectors 
to correlation coefficients, and then 
following standard factor analytic 
procedures, it is more appropriate to 
deal directly with the original set of 
vectors that compose the socioma- 
trix. The steps then would involve 
setting up a for the space 
spanned by the vectors, and then 
rotating the basis to give some satis- 
factory fit. 

Viewing the choice or communica- 
tion vectors as an arrangement of 
vectors in a space leads to certain 
other ideas. In this arrangement, the 
number of dimensions in the vector 
space is related to the homogeneity of 
the group. A completely homogene- 
ous group would have dimension 
equal to one. Everybody would 
choose everybody else, and would 


co- 


basis 








328 


have the choice vector (1, 1,1,---, 
1), if the diagonal entries are taken as 
equal to 1. All vectors would fall then 
on the equiangular line. The equi- 
angular line gives then a baseline for 
measuring the homogeneity of the 
group. If the group is completely 
centripetal, then each individual 
chooses or communicates only to 
himself. The vectors are then in a n- 
dimensional Euclidean vector space 
with the basis (1, 0, 0,---, 0), 
(0,1,0,--- ,0), etc. in which » =the 
number of individuals. It can be 
reasoned further that if the group 
consists of several subgroups, then 
this number will determine the 
dimensionality of the vector space. 
The number of dimensions required 
can be ascertained by solving for the 
number of nonzero latent roots of the 
matrix. 

The advantages of a factor or 
vector analytic approach are the fol- 
lowing: 

1. The position of an individual in relation 
to all the subgroups or cliques may be 
described. 

2. Use can be made of data in which differ- 


ent degrees of relationship are ex- 
pressed. 


It should be noted, however, that 
this type of approach involves a dif- 
ferent definition of clique or sub- 
group. Individuals are members of 
the same clique if their patterns of re- 
lationship to other individuals are 
similar. It may be, therefore, that 
the Luce-Perry type of analysis is 
best suited for certain types of rela- 
tionships (communicates with, influ- 
ences) whereas a vector analytic ap- 
proach is best suited for other rela- 
tionships (chooses, finds himself sim- 
ilar to). 

The practical implications of anal- 
ysis of a group into subgroups 
should be underlined. The analysis 
is necessary whenever it is desired to 
group together individuals who in- 
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teract (e.g., hand each other tools) 
most frequently. An example of this 
type of problem in an industrial 
setting may be found in the discus- 
sion by Chapanis, Garner, and Mor- 
gan (1949) on rearrangement of men 
and equipment in a shop. The appli- 
cation of the technique to other work 
situations, such as offices with com- 
munication problems, is obvious. 


ASSIGNMENT OF INDIVIDUALS TO 
SUBGROUPS 


The analysis of subgroups is pre- 
liminary to the planned location or 
relocation of individuals in sub- 
groups. It leads naturally, therefore, 
to the following general question. 
How can the individuals in a larger 
group be divided into subgroups on 
the basis of the relationships between 
the individuals? 

Hotelling (1954) raised the issue in 
terms of the best way to assign indi- 
viduals to a given number of teams, 
so that their sociometric choices de- 
termine their placement as much as 
possible. He set up the following pro- 
cedure for computing the total satis- 
faction of the group under various 
groupings. First the matrix of choices 
is examined and then an assignment 
matrix J is constructed. The assign- 
ment matrix has entries 1 or 0 indi- 
cating whether an individual 7 has 
been assigned to team j. For example, 
given the choice matrix in Table 1, 
with zeroes on the diagonal, it may be 
desired to separate the group into two 
subgroups of two men each. Two 
possible arrangements are indicated 
in the assignment matrices J; and J2 
in Table 6. In Jj, aand b are assigned 
to one subgroup and ¢ and d to the 
other. In Jo, a and d are placed to- 
gether and 6 and ¢ are placed to- 
gether. The good fortune of indi- 
viduals is defined in terms of J’AJ, 
the choice matrix postmultiplied by 
the assignment matrix and premulti- 
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TABLE 6 


ASSIGNMENT MATRICES AND 
SATISFACTION MATRICES 











x ¥Y x ¥ 
a 1 O a 1 O 
b 1 0 b 01 
a, 01 J2=, 0 1 
d 01 d 1 0 
wana * : wana ' 2 


i) 





plied by the transpose of the assign- 
ment matrix. 

The total satisfaction of the in- 
dividuals in each group is indicated 
by the sum of the values in the prin- 
cipal diagonal. Thus J2 with the en- 
tries on the diagonal summing to 
three would be considered a better as- 
signment than J; where the J’AJ di- 
agonal sum is two. This means that 
in J; three out of the eight positive 
choices fall within the assigned sub- 
groups. 

Katz has begun work on situations 
in which the sizes of the subgroups 
are permitted to vary (1952b). Work 
has also begun on the development of 
techniques to maximize the sum of 
entries on the diagonal of the satis- 
faction matrix (Katz, Olkin, & Powell, 
1952). This is a crucial problem be- 
cause the number of ways to partition 
n individuals into k subgroups with 
m members each is n!/(m!)k. This 
easily becomes quite large. For ex- 
ample there are 12!/(4!)3 or 34,650 
ways to partition 12 individuals into 
3 groups of 4 members each. This 
problem has similarities to the per- 
sonnel classification problems that 
can be solved by linear programming. 


OTHER APPROACHES: GRAPH THEORY 
Locic oF RELATIONS 
The concern with methods of de- 


scribing and analyzing groups has re- 
sulted in the attempt to bring a spe- 
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cial branch of mathematics, graph 
theory, to bear on the problems of 
group organization. A nonmathemat- 
ical presentation of graph theory was 
made in a monograph by Harary and 
Norman (1953). Most of the mono- 
graph is devoted to the discussion of 
definitions that have been employed 
in mathematical graph theory and 
the relation of the definitions to 
psychological concepts. For example, 
Lewin’s concept of boundary between 
two regions in a life space is trans- 
lated here as in Bavelas (1948, 1950) 
into a connection between two points. 

In addition, Harary and Norman 
consider a generalization of graph 
theory to adapt it to psychological 
problems. The generalization in- 
cludes the addition of such concepts 
as strength—the number of lines join- 
ing a pair of points. Strength, of 
course, has been discussed previously 
as weighted cell entries. Moreover, 
since the graph as defined in mathe- 
matical work centers on symmetric 
relationships, they also present the 
concept of the directed graph in 
which symmetry is not required. 

Although some studies have been 
done using the vocabulary of graph 
theory, its effect on psychological 
theory and experimentation has not 
been extensive to this time. Exam- 
ples of the use of the vocabulary may 
be found in the recent article by 
French on social power (1956) and in 
Weiss’ work on organizational struc- 
ture (1956). Out of the work in this 
area, however, other techniques have 
been developed for the analysis of 
subgroups. 

A point that has been stressed on 
the basis of graph theory considera- 
tions concerns the role of key or 
liaison positions in a group. These 
are positions that serve as links be- 


tween subgroups, i.e., positions 
whose elimination results in the 
group's failing into distinct sub- 
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groups. Ross and Harary (1955) have 
developed techniques for determining 
the liaison positions in a group. The 
steps in the analysis are the follow- 
ing: 

1. A symmetric matrix with binary entries 
is written describing the group’s struc- 
ture. Note that here again there is the 
restriction to symmetric matrices. 

2. A matrix of distances between positions 
is constructed. This can be computed by 
using powers of the original matrix. 
Zeros are entered in the diagonal and 
one is entered in each cell with a positive 
entry in the original structure matrix A. 
Successive powers of A are then taken. 
Whenever a cell entry first increases from 
zero, then the power of the matrix in 
which this change occurs is entered in 
the corresponding cell of the distance 
matrix. (See Luce and Perry's first 
theorem above.) For example, if a;; be- 
comes positive when the matrix is 
squared, a two is entered in the corre- 
sponding cell of the distance matrix. 
Successive powers of the matrix are 
taken until all cells are accounted for. 

3. The highest number in the distance 
matrix is found and the positions whose 
columns contain this maximum number 
are eliminated from consideration as 
liaison positions. (These are called 
peripheral points.) 

4. Positions whose entry in any row is the 
highest for that row are eliminated. 
(These are called relatively peripheral 
points.) 

5. Positions are liaison positions whose 
entries in any row satisfy the following 
requirements: 

(a) not zero, 
(>) not the maximum for that row, 
(c) unique for that row. 


These steps may still leave some 
positions unclassified. Ross and 
Harary present techniques for clas- 
sifying the remaining positions. The 
discovery of liaison positions can 
have two uses. One is to distinguish 
key personnel in an organization. 
The other is to afford a basis for anal- 
ysis of the group into subgroups or 
cliques. Harary and Ross (1957) 
have also presented another method 
for the analysis of cliques in binary 
matrices that covers the case of over- 
lapping subgroups. In order to do 





MURRAY GLANZER AND ROBERT GLASER 


this they introduce the operation of 
elementwise multiplication of ma- 
trices symbolized by X. Under this 
operation, each cell in one matrix is 
multiplied by the corresponding cell 
in the other matrix. The analysis is 
based on the use of individuals who 
are members of only one clique, “‘uni- 
cliqual”’ individuals. The steps in the 
analysis are the following: 


1. The symmetric matrix, S, is obtained by 
eliminating nonreciprocated cell entries. 

2. The matrix S*XS is constructed. The 
cell entries indicate whether a pair of in- 
dividuals belong to the same clique. 

3. All rows which consist entirely of zeros 
are eliminated. Their corresponding 
columns are eliminated. The reduced 
matrix is called M. 

4. The row sums, r, and the number of cell 
entries, m, greater than zero in each row 
are obtained for M. If, for a given row 

r = n(n—1) 
then the individual represented by that 
row is member of only one clique. All 
members of that clique can be isolated 
by selecting all other individuals with a 
positive entry in that row. 

5. If this clique does not include all mem- 
bers in M, this procedure is carried fur- 
ther. The group is divided into two sub- 
groups: those who belong only to the 
first clique, and the remainder. The pro- 
cedure described above is repeated on 
the remainder subgroup. 


After all unicliqual members are ac- 
counted for, further subgrouping and 
analysis is continued. These sub- 
groups are defined on the basis of 
members whose row sums are mini- 
mal in the last reduced matrix. 

In addition to the techniques and 
approaches discussed thus far, recent 
developments in the fields of mathe- 
matics and logic promise to be rele- 
vant. It may be that the study of 
groups requires consideration of more 
complicated relationships than the 
dyadic relationships considered above 
(e.g., a talks to 6 about what c said 
to d, or as in some of Tagiuri’s recent 
work (1952)—a expresses his atti- 
tude to 6, and predicts b’s attitude 
toward himself). Copilowish (1948), 
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Copi and Harary (1953), and Davis 
(1953) have considered such n-adic 
relationships. 


DISCUSSION 


One of the major outcomes of the 
work described above has been to 
present a set of techniques for sim- 
plifying and analyzing the complex 
data generated by group functioning. 
These techniques, moreover, involve 
the translation of the data into math- 
ematical form that permits the ap- 
plication of a wide range of powerful 
analytic techniques. In addition, 
these techniques are applicable not 
only to the original sociometric rela- 
tions (e.g., ‘‘like,’’ ‘‘chooses’’) but 
also to other relations (e.g., ‘‘com- 
municates to,”’ ‘hands materials to,”’ 
etc.) They therefore permit the anal- 
ysis of a much wider variety of or- 
ganizational relationships than hither- 
to studied. 

As these analytic techniques be- 
come better known and more widely 
applied they will help promote the 
construction of quantitative systems 


8’ Examples of the structural properties of 
formal military teams that may be submitted 
to this type of analysis are considered by 
Glaser (1958). 
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for group behavior. It should not be 
expected, however, that these or re- 
lated techniques will furnish easy 
solutions to the problems of group 
structure and functioning. In most 
cases they will probably be helpful 
only in clarifying the requirements 
for adequate descriptive or explana- 
tory systems. At best, these tech- 
niques will provide the variables to 
be incorporated in such theories. 
There remains the task of construct- 
ing theories concerning the behavior 
of groups. There are two types of 
theorizing possible with sociometric 
or structural data. One concerns the 
causes of particular patterns or in- 
dices. The other concerns the effects 
of these. An approach. to theories 
concerning the cause of patterns is in- 
dicated in some work by Landau 
(1951a, 1951b, 1953) on the special 
case of dominance relationships. 
Much more work of this type is 
needed before the usefulness of the 
various measures suggested in the 
sociometric literature can be assessed. 
The second type of theorizing con- 
cerns the effects of patterns. This 
type of theory initiated the group 
network studies. These studies will 
be evaluated in a subsequent paper. 
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The term “personality assessment”’ 
refers to any procedure aimed at de- 
scribing a person’s characteristic be- 
havior by categorizing him with re- 
spect to some communicable dimen- 
sion or dimensions.? Since the OSS 
assessment procedures, however, the 
term has tended to be pre-empted for 
the procedure where several different 
types of assessment techniques are 
applied to the subjects and the final 
assessments are made by the com- 
bined judgments of several assessors 
concerning the subjects’ predicted be- 
havior outside of the assessment situ- 
ation. These procedures are ‘‘mul- 
tiple’ in two senses: with respect to 
the techniques and with respect to 
the assessors. 

Our treatment in this paper will 
deal with the basic logic of this type 
of assessment, and the discussion 
will be illustrated by the best known 
multiple personality assessments, de- 
tails of which are outlined in Table 1.° 
Each one of these assessments has, 


1 The author expresses his thanks to the 
colleagues who have discussed various points 
in this paper with him, especially to James 
Lumsden; also to Saul B. Sells for his valuable 
comments. 

? This is the same procedure as “‘instantiat- 
ing a person object in a module or set of mod- 
ules,” a terminology which the writer has pre- 
ferred in another context (Sarbin, Taft, and 
Bailey, in press), but which is avoided here in 
the interests of communicability. 

* Insofar as the assessments use multiple 
techniques, the problems of inferring the pre- 
dictions and validating the tests are the same 
as those involved in other multi-variate pro- 
cedures. See, for example, the treatment of 
these problems in Thorndike (1949). Our 
emphasis here will be mainly on the problems 
that arise from the combination of multivari- 
ate procedures and multiple assessors. 


in its own way, constituted a mile- 
stone in the history of multiple per- 
sonality assessment. 

The researches into personality 
conducted at Harvard in the 30’s un- 
der the direction of Murray (1938) 
were the first to use the typical pro- 
cedures of personality assessment— 
diagnostic committee assessments of 
personality based on interviews and 
a varied battery of objective, projec- 
tive, and situational tests. However, 
unlike the later assessments, no out- 
side criterion was used in these Har- 
vard studies, and, therefore, no more 
than passing reference will be made 
to them. The same applies to the 
continuing series of studies of per- 
sonality carried out by Cattell and 
his students (Cattell, 1957) which 
started to employ external criteria 
only at an advanced stage of its prog- 
ress. The British War¥Officer Se- 
lection Boards (WOSB), which were 
inspired by the German officer multi- 
ple technique selection procedures 
(Farago & Gitler, 1941), pioneered 
the use of a quasi-natural social situa- 
tion, including the leaderless discus- 
sion, as a basis for judging the po- 
tential social skills of the candidate. 
They also produced the first valida- 
tion material on multiple assessment 
procedures as a means of selection. 
The British Civil Service Selection 
Boards (CISSB) continued this work, 
with more emphasis on the valida- 
tion of individual techniques as well 
as the technique as a whole. The 
OSS assessment highlighted the psy- 
chological problems inherent in as- 
sessment and won many supporters 
for the value of combining multiple 
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TABLE 1 


DETAILS OF MILESTONE ASSESSMENT PROGRAMS 
(Wide sample of techniques used: individual arid multiple interviews; observation of group 
activities and situational tests; objective, projective, and perform- 


ance tests; ‘‘made to measure”’ inventories) 











Assessment Date Assessees 

Harvard (Murray, 1938) | 1934-37 | Young men, mainly 
Harvard undergrad- 
uates (paid subjects) 

WOSB (Harris, 1949; | 1942-45 | British officer can- 

Morris, 1949) didates 

OSS 1944-45 | U. S. Intelligence 
and espionage agent 
candidates 

Michigan, VA (Kelly & | 1946-49 | Clinical psychology 

Fiske, 1951) graduate students 

California IPAR (Vari- | 1950-51 | Advanced graduate 

ous publish and un- students 

— reports, e.g., 

arron, 1954; Gough, 

1953) 

Chicago (Stern, Stein, & | 1952-54 | Students in theol- 

Bloom, 1956) ogy, education, and 
arts 

Menninger (Holt & Lu- | 1946-52 | Psychiatric training 

borsky, 1958) candidates 

















; Strategies 
> ¢ 
E ee | (in order of Assessors 
purpos importance) 
Personality Analytic Psychologists 
research 
Selection Analytic Army officers, psychia- 
Global trists, and psychologists 
Selection Analytic Psychologists, psychia- 
Global trists, and other social 
scientists 
Validation of | Empirical | Psychologists (clinical 
techniques Global and nonclinical) 


Personality Empirical | Psychologists 

research; val- | Analytic 

idation of Global 

techniques 

Validation of | Analytic Psychologists 
techniques Global 

Selection, vali- | Global Psychiatrists and psy- 
dation of tech- | Analytic chologists 


niques Empirical 














tests and observations by pooling the 
judgments of several assessors; the 
Michigan VA assessment program 
did much to upset that support while 
the Chicago and Menninger assess- 
ments reinstated some of it through 
their promising findings. The Cali- 
fornia Institute of Personality As- 
sessment and Research (IPAR) dif- 
fers from the other assessments in 
emphasizing research into personal- 
ity to a greater extent. 


THE ORIENTATION AND PURPOSE OF 
PERSONALITY ASSESSMENTS 

Three foci of assessment can be 
distinguished: human performance in 
some socially defined situation or 
situations (the criterion perform- 
ance); performance in defined as- 
sessment situations, i.e. tests (the as- 
sessment performance); and the link 
between these two performances (test 


validation). Different assessment 
programs have been oriented towards 
one or more of these aspects depend- 
ing on their primary purpose (see 
Table 1). The orientation towards 
criterion performance implies the pri- 
mary purpose of assessing candidates 
with respect to the criterion in order 
to select or reject them. The orienta- 
tion towards the assessment per- 
formance is concerned with the 
validation of the assessment tech- 
niques themselves, while the orienta- 
tion towards the link between per- 
formances is concerned with research 
on the functioning of personality. 
Selection was the original purpose 
for the WOSB, CISSB, and OSS 
assessments; in each case, the as- 
sessors were presented with the im- 
mediate problem of selecting from a 
given group of candidates those who 
would make the most adequate army, 
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TABLE 1 

Criterion 

analysis — of Main criterion } 

method — 

— “Committee No external criteria 
| 
. - P 

Personal knowl- | 1. Committee Supervisors’ reports 


edge and 2. Final | 


Review Board 





Intuitive and in- | Committee 


terview with ex 


Field reports by the 
| sors and by field command- | 
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( Continued) 





Some selected validities (uncorrected 
for selection of groups) 





| 1. CISSB committee 0.13-0.25. 2. Review Board 
| 0.23-0.41. (When corrected for selection the 
range of validities is 0.50-0.66.) 
“Over-all” ratings 0.08-0.53 
sessment group and criterion). 


asses- | 


(varying with as 
Rating of “Ef- 


perts ers on several molar traits | fective Intelligence,’” 0.33-0.53 
| | 
Personal knowl Individual and | Ratings by clinical teach- | ‘‘Over-all’’ rating and clinical competence, 0.37. 
edge pooled ratings | ers and supervisors on sev- | Miller Analogies and clinical competence, 0.35 
| eral aspects of clinical work | Strong Interest Key for Clinical Psychological 
and Research Competence, 0.35. Other validities 
lower 


Committee 
edge student's pri 


} 
Personal knowl- 
tential. 2. 


Committee job Committee 
analysis and in- 
terviews with 
teachers 


Teacher 
exam results 


Committee job Individual and | 


analysis an averaged rat- pooled) on specifix 
success and | ings of inter- genera! competence 
ers 


failure 


view 





civil service, and secret service of- 
ficers, respectively. After a consider- 
ation of the personality requirements 
of the positions for which they were 
selecting officers, the assessors judg- 
ing the candidates on the basis of 
techniques chosen either because 
they appeared to have face validity 
for measuring these requirements or 
because ‘the assessors were familiar 
with their use. At least in the case of 
wartime assessments, neither the 
time available nor the conditions 
permitted more scientific procedures 
than that, and it was hoped that 
accuracy would be achieved through 
weight of numbers (of techniques 
and of assessors). 

Test validation (and construction). 
Some of the later assessments, no- 
tably the VA study, set as their short- 
run aim the task of developing and 
validating techniques for future use 


1. Teacher's predic 
fessior 
Teacher's rat- 
ings of personal soundness 


‘s judgments and 


Supervisor's ratings 


tion of | 1. Cross-validated inventory with 1., 0.29 


1al po 2. Committee ratings with 2.,0.41 


Very high validities 


Interviews (global), 0.24. Interviews (analytic), 
| 0.26 ester's an alytic ratings on projectives, 
| 0.27. Objective scoring of projectives cross-vali 
| dz ited at zero. Best interviewer (all data) 0.57 


and 


in selection. The validation studies 
were applied not only to the individ- 
ual items and tests, but also to the 
purely subjective techniques, such as 
group observations and interviews. 


In some of them, e.g., Menninger, 
the individual judges were also 


validated as though their judgments 
were scores on a test. When we 
speak of the validity of assessment 
techniques it is important to include 
these judgments among the _ tech- 
niques, as they vary greatly in their 
accuracy. 

The Harvard studies were the first 
to use multiple assessment tech- 
niques for personality research, and 
the outstanding recent example is 
the IPAR work at California. (The 
large-scale factor analytic studies of 
personality, e.g., Cattell and Eysenck 
[1947], did not use the combined 
judgments of several assessors.) The 
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Harvard studies dealt with the cor- 
relations between different perform- 
ances that were elicited in the test 
situation, whereas the IPAR studies 
were concerned, in addition, with the 
relationship between the assessment 
performances and criterion measures 
such as ratings by university teachers 
of the subject’s professional poten- 
tial, his originality, and his personal 
soundness. The Michigan studies of 
clinical psychologists were similar in 
orientation. 

Most of the personality assess- 
ments have tried to pursue more than 
one of the above purposes at once, 
but there are drawbacks to such at- 
tempts at economy. For example, an 
attempt was made in the CISSB 
studies (Vernon, 1950) to combine 
selection and validation, but the 
validation indices were lowered and 
distorted by the attenuation of the 
sample through rejection of candi- 
dates. The low validities obtained 
became remarkably high (for that 
sort of prediction) when a correction 
for selection was applied, but such 
corrections are only arbitrary esti- 
mates. The use of assessment proce- 
dures for selection implies that the 
procedures have already been vali- 
dated, but this has usually not been 
the case. The assessors have either 
had to use whatever prior knowledge 
they possessed about the validity of 
the techniques for the purpose at 
hand, or they have had to base their 
predictions on the relevant postulates 
in their theory concerning the link 
between the assessment and the cri- 
terion behavior of the subjects. For 
example, the assessors presume that 
the situational tests in the assessment 
program have what Cronbach and 
Meehl have termed ‘‘content”’ valid- 
ity (1955). But in selection, this type 
of validity can be regarded only as a 
holding procedure for an ultimate 
“‘predictive” validity. Where the 
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criteria are imprecise and not re- 
peatable, or where selection is urgent, 
a separate validation study may not 
be practical, and under these circum- 
stances there is no alternative to con- 
ducting selection without prior vali- 
dation. It still may be possible, how- 
ever, Over a period of time, to utilize 
the imperfect validational material 
that becomes available in order to 
improve the existing selection proce- 
dures. This seems to have been the 
case, for example, in the OSS studies. 

Validation studies of the assess- 
ment techniques also logically pre- 
cede the use of those procedures for 
personality research, although tech- 
niques used in such research often 
are accepted on the basis of their 
face validity. To use the one and 
same study to validate the tech- 
niques and to use them to measure 
personality is lifting oneself up by 
one’s boot-straps. In fact, however, 
the assessments which attempt to 
carry out this dual purpose obtain 
independent support for the ‘‘boot- 
strap lift’’ from already existing in- 
formation regarding both validation 
and the functioning of personality. 
Even then, the interpretation of per- 
sonality research projects that do not 
commence with a pilot study on the 
validity of the instruments is always 
subject to doubt. How do you know 
that expressed hostility to authority 
figures on the TAT measures sup- 
pressed rebellious tendencies? How 
do you know, when assessor X ob- 
serves a subject to be dominant, that 
he is dominant? How do you know 
that observed “role empathy” in a 
role-playing test is a valid predictor 
of social skill? Such questions can be 
answered only by the progressive 
refinement of validity information 
and personality theory. 

Assessment procedures usually rely 
on many unvalidated tests, and when 
the correlations between the tests are 
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used as a means of studying per- 
sonality—as in the case, for example, 
of the Harvard and IPAR studies— 
it is necessary to decide whether 
these correlations are to be treated 
simply as validity indices, or whether 
the validity of the tests will be as- 
sumed and the correlations treated as 
throwing light on the relationships 
between different personality struc- 
tures. The problem of simultaneous 
validation of tests and the study of 
personality is related to the problem 
of “concurrent” and ‘construct’ 
validity. By setting up some of the 
behavioral measures made during the 
assessment as tentative criteria, it is 
possible to validate other assessment 
measures against these. Cronbach 
and Meehl call this concurrent vali- 
dation, and it is one way of utilizing 
previous knowledge of validities by 
choosing criteria measures that have 
reasonably well-established reliabili- 
ties and validities. Then on the basis 
of all that is known about these 
measures, their implications for the 
understanding of personality can be 
explored further by a strategy of 
construct validation. The data col- 
lected during the assessment can be 
added to the ‘‘nomological nets’’ al- 
ready used in thinking about the 
particular personality constructs and 
new hypotheses developed for inves- 
tigation in later studies. Thus, even 
an assessment program that is aimed 
primarily at the purpose of selection 
can make a contribution to personal- 
ity research through construct valida- 
tion. (The place of construct validity 
in an assessment program is discussed 
more specifically below under the 
heading of analytic strategy.) This 
concept also enables an assessment 
program to avoid the problem of the 
priority of validation of instruments 
(versus conducting personality re- 
search) by conceiving both validation 
and personality research as two as- 
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pects of the one endeavor, both as- 
pects gradually throwing light on 
each other as more and more data 
accumulate. 

But this double-aspect approach 
of construct validation is an uneco- 
nomical process. Refinements may 
often be made more readily to our 
personality theory or to our knowl- 
edge of the validity of the techniques 
by a more direct approach to one or 
the other. In this case the problem 
of priorities which we have discussed 
cannot be avoided. 


THE PREDICTION STRATEGIES 
IN ASSESSMENT 
The Criterion 

All assessment programs involve 
studies of the link between two or 
more pieces of behavior, whether the 
primary purpose be selection, valida- 
tion research on tests, or personality 
research. Some of this behavior is 
known as assessment behavior and 
some as criterion behavior. These 
concepts are analogous to the inde- 
pendent and dependent variables in 
experimental psychology, and it is an 
arbitrary decision by the experimen- 
ter which one is designated as which. 
Most of the reports of assessments 
have devoted some space to the cri- 
terion problem, especially the report 
of the Chicago assessments (Stern, 
Stein, & Bloom, 1956). Most of the 
problems are similar to those in- 
volved in the validation of mul- 
tivariate objective techniques dis- 
‘cussed, for example, by Thorndike 
(1949). 

A special problem that arises in 
personality assessment is the fre- 
quent unreliability of the criteria 
which so often represent subjective 
judgments that vary from one cri- 
terion rater to another. This un- 


reliability imposes a serious limita- 
tion on the potential validity of per- 
sonality assessments, and it makes it 
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difficult to evaluate some of the low 
validity coefficients reported. 

The designation of the criteria of 
performance is determined by the cir- 
cumstances of the assessment, and 
usually must be taken for granted by 
the assessors. Thus, in the Chicago 
study the assessors explicitly ac- 
cepted the principle that the criterion 
ratings represented the predilections 
of one or more supervisors with whom 
the subjects interacted in the cri- 
terion situation, and that the asses- 
sors’ predictions of the subjects’ suc- 
cess must be made in reference to the 
“‘psychological job requirements” im- 
plied by thes¢ predilections and in- 
teractions. The assessment strategy 
should be aimed at the criterion, once 
the latter has been established. Kelly 
(1957) did not accept this principle in 
his researches on medical school se- 
lection. In this study he analyzed 
the criterion measures and found 
that there were at least three, and 
possibly four, types of medical per- 
formance which could be predicted 
independently. In the long run, how- 
ever, a selection program has to 
choose between the independent cri- 
teria, or the criteria have to be com- 
bined by some type of simple, 
weighted, or complex, interactional 
summation, or by taking account of 
one critical instance. 

A complication that arises in cri- 
teria analysis, such as that of Kelly, 
is that an assessor can only predict 
to indices of the criteria, not to the 
actual criteria themselves. It may be 
possible in some instances for the as- 
sessor to demonstrate that an index 
used in assessment has a low correla- 
tion with some more satisfactory, al- 
though less accessible, criterion index; 
for example, that academic grades in 
medicine do not represent the doctor’s 
subsequent service to the community 
as a practitioner. Assuming that the 
latter is accepted as the more funda- 
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mental in medical practice, the as- 
sessors should predict to it rather 
than to academic grades by trying to 
obtain some accessible index which 
more realistically measures this cri- 
terion of community service. Some- 
times the assessors may be able to 
convince those who control the cri- 
terion ratings that the indices which 
the latter are using are not consistent 
with their fundamental criterion, but 
eventually the assessors and the cri- 
terion raters must agree on some 
criterion index in accordance with 
the policy of the organization. Other- 
wise it would be absurd to speak of 
the validity of the assessment. 

Three types of strategies can be 
distinguished for predicting the cri- 
teria performance: naive empircal, 
global, and analytic, and we shall now 
consider each strategy in detail. 

1. Naive empirical. This refers to 
the classical method of test construc- 
tion, adapted from aptitude testing, 
in which the inclusion in a selection 
program of a test—or test item, 
which may be treated for our pur- 
poses as a separate test—is deter- 
mined mainly by its predictive valid- 
ity, i.e., by the degree to which it cor- 
relates with or discriminates a speci- 
fied criterion. Tests that are not 
sufficiently valid are either dropped 
from the program or amended and no 
consideration is given to the meaning 
of the test behavior, except as an 
afterthought. The naive empirical 
strategy, thus, is one in which in- 
ference proceeds directly from test to 
criterion without the mediation of 
intervening variables. 

Not a great deal of use has been 
made of this empirical strategy in 
multiple personality assessment, 


partly because of intellectual resist- 
ance to atheoretical procedures on 
the part of personality researchers, 
and partly because of the absence of 
reliable criteria. The outstanding ex- 
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amples of the use of the naive em- 
pirical strategy are found in the 
IPAR studies, especially in the scales 
of the California Psychological In- 
ventory and the Adjective Check 
List compiled by Gough. These 
scales, which give unit weight rather 
than beta weights to the tests, i.e., 
items, enable predictions to be made 
to quite complex behavioral criteria, 
for example, tolerance, delinquency, 
academic achievement, neuroderma- 
titis, potential social status (Gough, 
unpublished bibliography, IPAR, 
1955). 

The naive empirical strategy has 
the advantage over other strategies 
of objectivity and also of enabling 
assessors to predict complex and little 
understood behavior. But it also 
has serious limitations: it can be used 
only where suitable criteria groups 
are available for validation and cross- 
validation, and the validities may 
“drift”? owing to changes in signif- 
icant aspects of the conditions—tem- 
poral, geographic, public attitudes 
and information, set of the subjects, 
etc. Either some understanding of 
the underlying theoretical factors is 
necessary to provide a warning sys- 
tem against ‘‘drift,’’ or constant re- 
validation must be carried out. 

The primary purpose served by the 
naive empirical approach is that of 
constructing, and validating assess- 
ment instruments, although the long- 
range purpose can be both selection 
and research on personality. Up to 
a point, the personality research aim 
can be served simultaneously with 
the validation aim, since the dis- 
covery of the intercorrelations be- 
tween the tests themselves and the 
criteria can suggest personality con- 
structs. But we are now back on the 
problem of priorities: we can use 
validation studies for personality re- 
search only if we already possess 
postulates about the significance for 
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personality of the behavior tapped by 
the tests and the criteria. 

In this reference we should briefly 
consider the sources of the test items 
that are used in the validation ‘“‘try- 
outs.”” The sources may be naive 
empirical, or they may be theoretical. 
Empirical sources include: tests in 
the general area that are traditionally 
used, those that are readily available 
and can conveniently be given, tests 
whose title or item content bear a 
superficial relationship to the cri- 
terion, and tests which have pre- 
viously been shown to relate to the 
criterion. Theoretical sources of 
tests, on the other hand, include the 
systematic or unsystematic. sampling 

usually the latter—-of the areas of 
personality that are considered by the 
researcher to be relevant to the cri- 
terion behavior. The empirical out- 
look of the student who is developing 
personality assessment techniques is 
seldom so naive that it is entirely 
uninformed by theoretical considera- 
tions, so that the ‘‘naive empirical” 
approach in practice tends to become 
mediated by intervening structures 
and thus to approach the analytic 
strategy described below. The in- 
tervening structures, however, are 
not made explicit in this empirical 
approach. 

2. Global. This is the second non- 
mediated strategy, in which the as- 
his intuition, em- 
pathy, and verstindnis processes to 
provide the predictions, rather than 
using statistically established asso- 
ciations between assessment behavior 
and criteria. If any analysis is made 
of the criterion in a global strategy it 
is directed at the social role expecta- 
tions for the criterion performance 
rather than at the required personal 
qualities for successful performance. 
(The latter is more appropriate to the 
analytic strategy discussed below.) 
Information may be given to the as- 


sessor relies on 
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sessors*about the subject’s perform- 
ance on objective tests, and even con- 
cerning the validity of these tests— 
for example in the Menninger studies 
(Holt, 1958)—but the ultimate as- 
sessment is a global one. This proce- 
dure is the personality assessor’s 
answer to some of the drawbacks of 
the empirical strategy. Intuitive pre- 
dictions can be used when the asses- 
sors have only a vague concept of the 
criterion conditions, but empirical 
methods require clear-cut criteria and 
expendable samples of trial subjects 
who have been rated on these criteria. 
Where this is impossible, as in the 
case of the OSS studies, the empirical 
strategy cannot be used, and intui- 
tion must be resorted to. 

The distinction between empirical 
and global strategies also is analogous 
to the distinction between narrow- 
and wide-band techniques (Cron- 
bach & Gleser, 1957), the former en- 
abling comparatively more reliable 
but limited predictions. Supporters 
of the global strategy have claimed 
for it special adaptability to the 
vagaries of the conditions associated 
with both the assessment and cri- 
terion situations. Some writers also 
claim for it a special virtue in con- 
nection with personality research in 


that it avoids the violation of a 
“whole” person inherent in trait 
psychology; however, it is very 


doubtful whether it is correct to use 
the word ‘‘research’’ to describe a 
mode of study which, if it were ap- 
plied in its pure form of global 
verstindnis, would by definition pre- 
clude communication of the assess- 
ments. 

The value of the claims of the 
global strategists to have improved 
on empirical validation as a basis for 
selection programs is limited. Sub- 
jective methods of making predic- 
tions have seldom been shown to be 
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superior to objective methods where 
these are available, excepting in the 
case of especially competent assessors 
(see below). The relative competence 
of the assessors in making predictions 
about the subjects is analogous to the 
relative validity of the tests, and 
both can be established by the same 
type of validation techniques. In 
this way the empirical and the global 
strategies are similar in orientation: 
the proviso that incompetent as- 
sessors should either be eliminated 
from the assessment panel or trained 
to eliminate errors is analogous to the 
dropping or amending of an invalid 
test in the empirical strategy. 

The “‘nonanalytic”’ techniques used 
in the global strategy are not neces- 
sarily nonmediated by personality 
constructs, even though these con- 
structs may not be made explicit. 
The process of moving from observa- 
tions of behavior to inferences about 
future behavior uses a set of postu- 
lates about personality and various 
derived premises; these premises in- 
volve certain personality constructs 
or categories into which the assessor 
places the behavior of the subjects, 
i.e., he “‘instantiates’’ the behavior. 
Intuitive inferences, even empathic 
ones, can be reduced to this formula- 
tion which provides a bridge between 
analytic and nonanalytic processes. 
This point is elaborated by Sarbin, 
Taft, and Bailey (in press). 

3. Analytic. The analytic strategy 
makes explicit the role of mediating 
constructs in prediction. A_ two- 
stage inference is involved; first, 
there is an inference from the cri- 
terion requirement to the traits that 
are relevant to that performance (the 
“criterion analysis’); and, secondly, 
an inference from the subject’s ob- 
served behavior and test performance 
to his status on the trait dimensions 
(the assessment). Research on the 
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validity of these inferences requires 
two separate studies: one of the 
validity of the analysis of the cri- 
terion requirements and the criterion 
indices, and one of the validity of the 
tests as predictors of the criterion. 
These validation studies should be 
based on independent samples of be- 
havior and, for preference, on inde- 
pendent samples of subjects, the re- 
search on the criterion analysis to 
precede the validation of the in- 
struments. 

The importance of criterion anal- 
ysis was recognized in each one of 
the ‘“‘milestone’’ assessments, but the 
validity of the analysis is usually as- 
sumed. Two types of approach to 
the criterion analysis problem have 
been used: intuitive and empirical. 
The intuitive approach is the one 
usually used in personality assess- 
ment; typically the assessors have 
used either the testimony of ‘‘ex- 
perts”’ or their own theoretical anal- 
ysis to determine the criterion re- 


quirements. These analyses rest on a 
theory of personality, but the theory 
is usually not make explicit, nor is it 
subjected to empirical validation. 
The empirical approach to criterion 
analysis can employ subjective or ob- 


jective .methods. The Menninger 
studies, for example, employed sub- 
jective rating methods to compare 
the characteristics of successful and 
unsuccessful psychiatrists.‘ The VA 
assessment program was, among 
other things, one big empirical cri- 
terion analysis using both subjective 
and objective methods. The study 
began with no explicit analysis of the 


* Knowledge of the results of this analysis 
did not improve the validity of the assessor’s 
predictions, but this could have been caused 
by the assessors preferring to use a global 
rather than analytic strategy despite the 
analytic information which was supplied 


(Holt, 1958). 


341 


requirements in clinical psychology 
and ended with an explicit descrip- 
tion of some of the characteristics 
which relate to success in various as- 
pects of that profession. In a sense, 
all preliminary validation try-outs of 
tests in a naive empirical strategy, 
such as those used in the VA and 
IPAR programs, constitute a cri- 
terion analysis. The cross-validation 
that follows may thus be regarded 
as testing a series of hypotheses about 
the criterion behavior. Referring 
once more to Cronbach and Meehl’s 
contribution (1955), we see now that 
the analytic strategy is a type of con- 
struct validation which attempts to 
augment the ‘“‘nomological net’’ sur- 
rounding the relevant constructs. 
The main difficulty with the ana- 
lytic method of assessment is that it 
requires a set of constructs which 
may not exist in our present state of 
psychological knowledge—although 
the assessment results may contrib- 
ute to the development of such con- 
structs. The difficulties which factor 
analysts often encounter in their at- 
tempts to label their factors leads one 
to sympathize with Cattell’s pref- 
erence for using reference letters and 
numbers rather than trying to find 
meaningful labels for his personality 
factors (Cattell, 1957). The analytic 
method, then, is limited by the cur- 
rent state of development of per- 
sonality theory. A further drawback 
of a thoroughgoing analytic method 
of assessment is the practical con- 
sideration of economy of effort; the 
returns may be just as great, prob- 
ably greater, in the first pilot assess- 
ments in a program, if we use an 
empirical or global strategy without 
trying to make explicit the underly- 
ing theoretical relationships. In addi- 
tion, analytic assessments require a 
double inference and consequently 
the possibilities of error are increased ; 
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either the criterion analysis or the 
ratings of the candidate might be in 
errors. In the analytic strategy, how- 
ever, there is at least the hope that 
the sources of these errors will be 
discovered and corrected, whereas 
the sources are masked in the non- 
mediated strategies. 

The analytic strategy is applicable 
to any of the three purposes, selec- 
tion, validation research or personal- 
ity research, but its greatest poten- 
tial is in the latter; in fact, if the re- 
sults of assessment are to be of any 
value in increasing our understand- 
ing of personality, it is essential that 
the data be expressed in terms of 
basic personality constructs under- 
lying the subject’s behavior so that 
the scores and observations on the 
subjects may become meaningful. 
This applies both to naive empirical 
strategies such as factor analysis or 
blind item validation, and to global 
strategies in which the mediating 
constructs are not made explicit. 

To sum up: we have argued that 
both the naive empirical and the 
global strategies are actually medi- 
ated by analytic personality con- 
structs, but that it is not always 
necessary, or even possible to make 
those mediating variables explicit. 
This may apply both when the pur- 
pose of the assessment is validation 
of the techniques or the carrying out 
of an actual selection. But when the 
purpose is personality research, some 
explicit handling of the constructs is 
advisable. The concept of construct 
validation supports this requirement 
by merging the validation and the 
personality research orientations. 

Each of the three strategies has its 
particular uses in assessment pro- 
grams. Where mass screening is re- 
quired, the empirical strategy is usu- 
ally best, if possible; where the cri- 
terion situation is complex and un- 
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repeatable, but familiar to the asses- 
sors, the global approach is to be 
preferred, and where the relevant 
personality theory has attained a suf- 
ficient level of development, the 
analytic strategy is indicated. Where 
none of the basic requirements are 
present—a repeatable and reliable 
criterion, familiarity of the criterion 
to the assessors, or appropriate per- 
sonality theory—the assessors have 
to choose the strategy that seems 
best, although no strategy can really 
redeem such a hopeless situation. In 
general, personality assessors being 
what they are, they will prefer a 
largely intuitive approach, either 
analytic or global, as they did in the 
WOSB and OSS situations, but an 
increasing respect seems to be paid 
to the need for illuminating these in- 
tuitive methods by empirical anal- 
ysis wherever possible. 


SoME SPECIFIC ISSUES IN ASSESSMENT 
AS A METHOD OF PREDICTING 


Clinical Versus Statistical Approaches 


We have argued that there are oc- 
casions when intuitive methods of 
making predictions, i.e., ‘clinical’ 
have their appropriate place. Statis- 
tical methods cannot be used where 
no prediction formula exists. But 
some personality assessors speak as 
if the clinical method is always to be 
preferred as it enables the assessor to 
be flexible in his use of the data in a 
way that is not possible with statis- 
tical techniques; for example, the 
clinician can give weight to obvious 
but rare and nonrepeatable factors in 
the subject’s current situation which 
could not be validated empirically. 
Other advantages claimed for the 
clinical against the statistical ap- 
proach are that it does not violate 
the essential unity of the subject’s 
personality, and that it enables the 
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use of empathy and recipathy in 
making the predictions. (Actually 
these subjective clues could also be 
used as data by the statistician along 
with other more objective data.) 

Other assessors regard clinical tech- 
niques as only a last resort. A num- 
ber of advantages can be quoted for 
statistical prediction over clinical, 
most of which boil down to the fact 
that the statistician has a far more 
efficient memory and a larger atten- 
tion span than the clinician; he can 
‘“‘remember” the relevant data at the 
appropriate time and combine them 
with other data in order to obtain 
optimal weightings for future pre- 
dications. 

And so we have, on the one hand, 
the efficient but rigid and inhuman 
statistical prediction, and on the 
other, the flexible and humane but 
inefficient clinical. Which one is more 
useful in personality 
There are several discussions of this 


assessment? 


question available (e.g., Cronbach, 


1956; Meehl, 1954; Holt, 1958; 
Sarbin, Taft, and Bailey, in press; 
McArthur, 1954) so the points will 
not be elaborated fully here except- 
ing in so far as they directly affect 
multiple personality assessment pro- 
cedures. 

The weight of the evidence clearly 
supports the accuracy of the statis- 
tical approach compared with the 
clinical. Meehl’s notorious 
board (1954) recording the relative 
validity of clinical versus statistical 
prediction mounts grim evidence in 
favor of the latter. Holt (1958) crit- 
icizes Meehl’s summary on_ the 
grounds that most of the studies 
pitted sophisticated, actuarial pre- 
dictions against ‘‘naive clinical,” 
while others (e.g., Wittman’s) ac- 
tually showed the superiority * of 
“sophistical clinical’’ over naive clin- 
ical methods. But Meehl is quite 


score- 
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clear about the rules of his contest: 
the rival methods start off with ap- 
proximately the same objective and 
subjective data, although in some of 
the studies the clinician used addi- 
tional subjective data. The impor- 
tant difference is that the reported 
statistical predictions were based on 
the naive empirical method of valida- 
tion, while the clinical were either 
global or intuitively analytic. The 
statistical approaches were not con- 
cerned with the meaning of the cor- 
relations between the data and the 
criteria, although the use of cross- 
validation and statistical refinement 
meant that the empirical procedures 
were not as naive as it appeared, nor 
were they always uninformed by in- 
tervening personality constructs. 
Holt pleads for the use of “‘sophis- 
ticated clinical methods,” by which 
he means something similar to our 
analytic procedure, using intuition to 
make the final predictions. Among 
other things, he wants the clinicians 
to make preliminary studies of the 
criterion behavior, in order to analyze 
the requirements for success. Holt 
does not take the step of requiring 
validation of the individual clini- 
cians; but this is necessary to match 
fully the two sides in the contest. He 
reports that the best judges, using 
global clinical techniques, reached 
prediction validities of up to 0.57, 
whereas statistical treatment of the 
regular Rorschach _ scoring 
(validated and cross-validated) and 
the Strong Interest Psychiatrist key 
resulted in virtually zero validities. 
But Holt’s contest is unfair to the 
statistical side. His experiment was 
a half-hearted affair; no attempt was 
made to develop objective tests that 
would be appropriate to the selection 
problem at hand, as was done in the 
VA and the IPAR studies, and on 
Holt’s own admission the Strong key 


tests 
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was validated a long time previously 
in an entirely different situation. 
Holt’s report, as he himself indicates, 
does not provide us with a fair con- 
test between sophisticated clinical 
and sophistical _ statistical ap- 
proaches. 

In recommending a sophisticated 
clinical approach, Holt argues that 
“there simply is no substitute for 
empirical study oi the actual asso- 
ciation between a type of predictive 
data and the criterion’’ (1958, p. 3). 
Despite this, the evidence that he 
presents on the value of objective 
criterion analysis for the assessor 
(i.e., validation) is not promising. 
The assessors at Menninger were 
provided with ‘“‘manuals’’ embodying 
validation material on the interview, 
TAT, Rorschach, and other assess- 
ment techniques that had been used 
in an earlier assessment of psychia- 
trists at Menninger. Holt’s conclu- 
sion about their value reads as fol- 
lows: “Of the six, two proved worth- 
less... ; the other four all showed 
more or less promise, but there was 
none that yielded consistently signifi- 
cant validities regardless of who used 
it’ (1958, p. 8, italics ours). Evi- 
dently, the assessors would not, or 
could not, use the validation data 
which were provided for them. 

We are thus reminded once more 
that validation includes validation of 
the specific assessors carrying out the 
specific assessment task. Nearly all 
reports of personality assessments 
offer evidence that the assessors dif- 
fer considerably in their predictive 
skill. These differences are made up 
of two types of variation; variation 
due to differences in general ability 
to judge people (Taft, 1955), and in- 
teraction effects between the assessor 
and the type of judgment called for 
(Crow & Hammond, 1957). The re- 
ports on assessments offer the hint 
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that the highest validities are 
achieved by assessors who have the 
most familiarity with the criterion 
situations and with the type of per- 
son who is successful in those situa- 
tions; for example, in the CISSB 
assessments, the Board of Review 
consisting of experienced civil service 
administrators made more accurate 
predictions than did the original 
CISSB selection committee. In the 
former, the most valid predictions 
were made by the chairmen who were 
also civil service administrators. 

Accurate assessments are most 
likely to occur where the assessor 
uses the in-group stereotypes which 
are also held by the criterion raters; 
they are able to “play their predic- 
tions by ear’’ without any need to 
make the double inference involved 
in analytic techniques. In support of 
this method of predicting we can 
quote the comparatively high validi- 
ties found for ratings of the “‘like- 
ableness’”’ of the candidates in the 
Michigan, Menninger, and IPAR 
assessments. For example, in the 
latter, the assessors were mainly uni- 
versity professors, and it is therefore 
not surprising that their ratings of 
“personal soundness”’ correlated as 
high as 0.52 with ratings made of 
the candidates on this quality by 
their own departmental professors 
(Barron, 1954). All other things be- 
ing equal, the best assessors for pre- 
dicting existing criteria are those who 
are partially contaminated with the 
same experience, standards, and out- 
look as the criterion raters and can 
thus rely on a global strategy to make 
their predictions. (The most ac- 
curate assessors are also more ac- 
curate than the most accurate, cross- 
validated, tests.) 

The validity of analytic methods 
is subject to the accuracy of the per- 
sonality theory which the assessor 











uses, but psychologists usually pos- 
sess fairly stable postulates, based 
on the lore of their discipline rather 
than behavior-oriented empirical re- 
search, and these are not readily 
changed in the light of actual em- 
pirical data. This is probably the 
reason why some of the Menninger 
assessors did not improve their ac- 
curacy with the help of the em- 
pirically-based ‘‘manual.’”’ The dif- 
ficulty can be seen clearly if we con- 
sider the findings of the Minnesota 
starvation studies (Kjenaas & Bro- 
zek, 1952) that the Rorschach in- 
dices of adjustment had a negative 
validity in predicting ratings of the 
subject’s adjustment after starva- 
tion. Could a typical clinical psychol- 
ogist bring himself to reverse com- 
pletely his normal interpretation of 
the Rorschach in order to predict the 
subject’s adjustment under the cri- 
terion conditions? Not unless he 
were able to find an intervening vari- 
able between the Rorschach and the 
criterion that would enable him to 
understand the connection within the 
framework of his existing theory of 
personality. 

Our discussion of clinical versus 
statistical methods of assessment has 
concentrated on one aspect of the 
procedure, the  prediction-making 
stage. The contrast between these 
two approaches can be made in con- 
nection with a whole chain of deci- 
sions that must be made in the 
course of assessment: these decisions 
include determining the acceptable 
criteria, scoring the criterion be- 
havior, conducting the criterion anal- 
ysis, determining the form of the 
tests and standard situations, observ- 
ing and classifying the assessment 
behavior (i.e., scoring), combining 
the observations made by any single 
assessor into an assessment or pre- 
diction and combining the predic- 


MULTIPLE METHODS OF PERSONALITY ASSESSMENT 345 


tions made by different assessors. 
For example, should the individual 
assessments be combined  subjec- 
tively by the chairman of an assess- 
ment board, by voting, or by averag- 
ing the individual predictions? In- 
sufficient attention has been given to 
the relative merits of subjective and 
objective methods at each one of 
these stages. 

The choice of method will depend 
on both the requirements and the 
over-all situation, including, some- 
times, public relations considerations. 
The final selection of assessment 
techniques is likely to be a mixture 
of both subjective and objective, but 
the circumstances that will favor 
one or the other at any stage are 
rather vague, and the choice is 
usually made on subjective grounds, 
although it, too, could be made on 
the basis of objective, empirical in- 
vestigation. In general, objective 
methods are to be preferred as far 
as possible as they maximize accu- 
racy, but practical considerations of 
economy, convenience, and the limi- 
tations of the situation, dictate the 
wholesale use of subjective methods 
in personality assessment. These 
subjective methods may have high 
validity under favorable circum- 
stances, and where the assessors are 
familiar with the criterion situation, 
clinical judgments may actually be 
more accurate than any objective 
methods are ever likely to be in 
predicting to criteria. 


Conditional Variables in the Criterion 


An old problem in evaluating the 
validity of prediction is set by varia- 
tions in the criterion situation attrib- 
utable to the surrounding conditions. 
For example, a prediction that a 
candidate will make a good officer 
may be invalidated through some 
contingency such as being posted to 
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a commanding officer with whom he 
is incompatible. But these condi- 
tional factors do not stand on their 
own; there is an interaction between 
the person and the condition. Thus, 
Officer A may have the type of per- 
sonality (or background) that makes 
it likely that he will be posted to a 
commanding officer with whom he 
will be incompatible; if, for instance, 
Candidate A is Jewish, he is more 
likely to have a CO who behaves un- 
congenially than is another candidate 
of a similar personality who is not 
Jewish. Further, Officer A may per- 
form his duties better than otherwise 
when he has an uncongenial CO, 
while Officer B may perform his 
duties worse under the same circum- 
stances. In most assessments, no 
specific reference is made to such con- 
ditional factors and there is an im- 
plicit assumption of “given normal 
conditions” attached to the predic- 
tions. The OSS reports a validity of 
only 0.19 for all cases from Station S 
compared with 0.39 for only the cases 
who were given assignments that 
were consistent with the ones for 
which they were assessed. 

A further condition that is often 
ignored in assessment is that of 
effluxion of time; the predictions are 
usually made on the assumption that 
the status of the candidate on the 
relevant variables will remain con- 
stant over time. At a more sophis- 
ticated level, trends towards change 
may be observed in the candidate 
together with potential but as yet un- 
realized capacities, and the assess- 
ment may extrapolate these into the 
future. But it is virtually impossible 
to take into account subsequent 
learning, maturation and deteriora- 
tion in the assessment prediction. 

In this connection, Cronbach and 
Gleser (1957) have proposed a useful 
distinction between fixed treatment 


(the same conditions for all success- 
ful candidates) and adaptive treat- 
ments varying according to the can- 
didate. Evidently the treatment of 
the OSS selectees was fixed rather 
than adaptive, and the predictions 
should have taken this into account. 
Five different types of solutions are 
suggested below for the problem of 
conditional factors in these treat- 
ments, Solutions 1 and 3 being par- 
ticularly appropriate to fixed treat- 
ments, and the other three to adap- 
tive. (These represent an expansion 
of the three solutions proposed in 
Horst, 1941, ch. 5.) 


1. Adjust the criterion ratings ex post facto 
according to the ease or difficulty presented to 
the candidate by the criterion conditions and 
the effects of these conditions on him over the 
relevant period of time. This adjustment re- 
quires an intuitive judgment that takes into 
account the interaction between the condi- 
tions and the candidate, and this can be done 
only by the rater making a further, independ- 
ent assessment of the candidate. For the 
validation to carry conviction, it is necessary 
that the adjustment to the criterion rating be 
made independently of the assessment. 

2. Make the predictions to the ideal pos- 
sible conditions so that they represent the 
candidate's fullest potential; the criterion 
ratings can then be made in accordance with 
the same standards. In other words, both 
assessment and prediction attempt to hold 
conditions constant in the form that is con- 
sidered to be optimal for the candidate's per- 
formance. The actual conditions applying 
at the time of assessment and during the cri- 
terion performance are unlikely to be optimal 
no matter how hard this state is sought, so 
that the use of this solution rests very heavily 
on intuition. 

3. Predict to the average or modal condi- 


‘tions that have prevailed in the past with re- 


spect to the criterion situation, or which are 
expected to prevail in the future. This is the 
usual orientation in assessments based on em- 
pirical validation since the correlations on 
which the validities are based are in effect 
averages. The empirical strategy automati- 
cally takes into account the variations in con- 
ditions as well as their average effect and 
maximizes the prediction to these average 
conditions. The conditions, thus, influence 
the assessment only through their effects on 
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the criterion performance, without regard to 
their specific nature. It is practically impos- 
sible for a clinician to average all possible 
relevant conditions by an intuitive act, al- 
though it is common for a clinician to bear in 
mind the modal conditions which candidates 
face when these are prominent. 

4. The future conditions may be predicted 
specifically for each candidate so that the in- 
teraction between the candidate and the cri- 
terion conditions may be anticipated in the 
assessment. The prediction to the future 
conditions may be made on the basis of inside 
knowledge of the treatment to be given to the 
candidates in the criterion situation, or by 
forecasting on general grounds the sp: cific 
changes that will occur in the conditions be- 
fore the criterion ratings are made. Such pre- 
dictions must be intuitive rather than em- 
pirical, and, by the nature of the complexity 
of man’s environment, all such intuitive pre- 
dictions must fall well short of perfect valid- 
ity. In some complex situations, in which the 
criterion performance is highly dependent on 
the conditions, the inability of the assessors 
to predict the specific conditions that will 
operate for any particular candidate may 
render the assessments completely invalid. 

5. The predictions themselves can be made 
in terms of specific conditions: “if X condi- 
tions occur, then the candidate will be success- 
ful.”” In this endeavor, the recent proposal by 
Cattell (1957, pp. 426ff.) for a taxdbnomy of 
situations might eventually supply a list of 
standard situations to be considered in condi- 
tional predictions. 


Solutions 4 and 5 are both specific 
conditional solutions which can take 
into account the effects of conditions 
that are external to the candidate, as 
well as intrinsic conditions such as 
maturation. They require both a 
knowledge of the criteria require- 
ments and a correct assessment of 
the candidate, but the first type of 
conditional prediction emphasizes the 
criterion situation, and the second, 
the candidate. Both of these latter 
methods of meeting the problem 
of conditions are adaptable to tak- 
ing into account multiple condi- 
tions and also ‘‘adaptive treatments”’ 
such as provisions for training that 
are tailor-made for the candidate. 
They hold out the possibility of mak- 
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ing more exact predictions than can 
be made by the other three attempted 
solutions to the problem. This is one 
of the reasons why the global strat- 
egy, or slightly analytic versions of 
it, have been so often favored in se- 
lection assessment programs. But 
these conditional predictions are also 
the most difficult to make, and only 
the best judges of personality or the 
ones who are most experienced with 
the criteria conditions are able to 
make them accurately, and then only 
in appropriate situations. 

The decision as to the appropriate 
solution to the problem of varying 
conditions is closely related to the 
choice of strategy. In the long run 
the choice is one between elegance 
and the practical limitations that are 
imposed on the possibilities of accu- 
racy. 


The Assumption of Safety in Numbers 


Personality assessment programs 
rely on numbers to improve their 
validity in two directions: multiple 
tests and multiple assessments. We 
shall treat the evidence concerning 
these two points separately. 

Multiple tests. Where the tests and 
other assessment measures are com- 
bined objectively, for example, in 
accordance with a multiple regression 
equation, even the most valid test 
can usually be improved upon by 
adding one or two further measures 
to it. It is often striking, however, 
how quickly the multiple Rs reach 
their ceiling; the common compo- 
nents of almost all available person- 
ality measures seem to be so high 
that we quickly exhaust the new ele- 
ments that additional tests can bring 
to the predictions. The same applies 
when the combining of elements is 
carried out intuitively; even though 
the clinician may believe that the 
pieces of information about a candi- 
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date are independent of each other. 
It is doubtful whether a clinician can 
use more than a few pieces of data 
that are relatively independent, even 
if they can be found. Sarbin (1942), 
for instance, demonstrated that clini- 
cians who were given a mass of data 
from which to predict the success of 
university students, gave most of the 
weight to two variables only. 

Evidently, to give a clinician more 
than two or three pieces of data about 
an assessee is likely to be of little 
value. Some critics go even further, 
claiming that giving extra data ac- 
tually reduces validities by confusing 
the allocation of subjective weights 
to the predictor variables, and by in- 
creasing the variability of the predic- 
tions, i.e., inducing the clinician to 
venture into making extreme judg- 
ments which increase the risk of mak- 
ing large errors. Kelly and Fiske 
claim (1950) that in the Michigan 
study validities declined as more 
data were given to the assessors. Holt 
challenges the accuracy of their inter- 
pretation of the findings (1958, p. 8), 
but even so there are other studies 
that suggest that more data do not al- 
ways improve accuracy (e.g., Gage, 
1953; Giedt, 1955; Kostlan, 1954; 
Soskin, 1954). In Giedt’s study, for 
instance, the clinicians were able to 
make more valid predictions of 
mental patient’s personalities from 
sound recordings than from sound 
movies. 

But there are several studies af- 
firming that, at least under some cir- 
cumstances, more data do enable 
clinicians to improve their accuracy. 
We have already referred to Vernon’s 
report (1950) that in the CISSB selec- 
tions the Board of Review was able to 
improve on the assessment board’s 
recommendations by combining these 
with their own interview impressions 
of the candidate. Increased validity 
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with increased data is also reported 
for the California (Mackinnon, 1951), 
Chicago (Stern, Steii, & Bloom 
1956), and Menninger (Holt, 1958) 
assessments. 

We must suspend our verdict on 
the value of multiple data at this 
stage. Evidently there are circum- 
stances that can overcome the limita- 
tions on the ability of a single assessor 
to hold in mind data and to combine 
them. One suggestion worth testing 
is to combine data into subdecisions 
of an increasing degree of molarity, 
until the final molar decision is 
reached. This procedure can assist 
the clinician to consider all of the 
data in reaching his final decision 
and it is analogous to the use of 
structured schedules and rating forms 
that are used by interviewers to con- 
solidate portions of the data as they 
go along. This technique as a general 
aid to clinical judgments seems worth 
experimenting with, although the 
danger must be avoided of giving too 
much weight to the data that are 
presented first. In this respect it 
would seem to be wise to seek out 
first the data that are believed to be 
the most valid. 

Another way of handling the com- 
bining of data is to use several asses- 
sors, each responsible for one or two 
different techniques or areas of per- 
sonality. This was the method 
adopted, for example, in the CISSB 
assessments. This proposal carries 
over to the general question of using 
multiple assessors and we shall con- 
sider it further below. 

Multiple assessors. The practice of 
using more than one assessor in selec- 
tion work is an old one; the assump- 
tion has been that the more assessors 
there are, the more ideas will be 
thrown into the pool and therefore 
the more thorough will be the mar- 
shalling of data. Where ratings are 
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pooled, it is also hoped that errors 
will cancel each other out. Very little 
experimental material is available on 
the relative value of group versus in- 
dividual judgments in personality 
assessments, but evidence can be 
used from other work on other types 
of group performance (see Kelly & 
Thibaut, 1954; Klein, 1956, ch. 1; 
Argyle, 1957, ch. 5). 

These findings suggest, among 
other things, that accuracy of judg- 
ments increases with the size of the 
group, but the optimum number in 
informal problem-solving groups is 
possibly five, since larger groups re- 
quire formal structuring in order to 
ensure adequate communication of 
information; that compatible mem- 
bership is important in problem-solv- 
ing committees; that democratic 
groups produce more different ideas 
than individuals but fewer per per- 
son; that the quality of group deci- 
sions increases with an increase in the 
skill of the members; that groups are 
quicker at solving problems than in- 
dividuals, although less economical 
in terms of man-minutes. However, 
these findings vary according to the 
type of task concerned, and before 
we can carry them over to personality 
assessment it is necessary to bear the 
type of task in mind. 

Some of the questions that should 
be asked concerning group factors in 
personality assessment are: are group 
ratings more accurate than those of 
the individual members of the group; 
does group discussion by the assessors 
improve accuracy over pooled indi- 
vidual ratings; what is the relative 
value of means, modes, and medians 
as methods of pooling; the ideal size 
of committees; committee ratings 
versus averaging; authoritarian lead- 
ership of assessment committees ver- 
sus democratic; should all of the com- 
mittee members be given the same 
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data; should both the observations 
and the interpretation be made by 
groups? These questions can be con- 
sidered at three points in the assess- 
ment procedures: (a) in making sub- 
jective observations of the subjects; 
(b) in eliciting data from the subjects; 
and (c) in integrating and interpret- 
ing the data, and making the deci- 
sion. 

(a) At the observational level, we 
should expect that the pooled ratings 
of several observers would be more 
accurate than individual ratings, 
since pooling reduces the error vari- 
ance, provided always that the indi- 
vidual judgments have some validity 
in the first place (cf. Kelley & Thi- 
baut, 1954, p. 739). 

(b) The value of the group inter- 
view versus individual interviews asa 
means of eliciting data is equivocal 
(see the discusssion in Oldfield, 1947). 
A recent study (Glaser, Schwarz, & 
Flanagan, 1958) on the selection of 
supervisors found that individual rat- 
ings based on group interviews by a 
panel of three were no more accurate 
than the ratings made by one inter- 
viewer per candidate. While it is true 
that the group situation may elicit a 
wider sample of behavior than an in- 
dividual interview, it is more difficult 
for an interviewer to evaluate the sig- 
nificance of the group as the stimulus 
to which the candidate is responding. 
However, if the interviewing is con- 
ducted by the chairmen only, while 
the other assessors are simply ob- 
servers, this may enable the assessors 
to make more unbiased judgments 
than when they are actually involved 
in the interviewing. This effect still 
remains to be tested empirically. 

(c) The usual consideration of the 
value of group assessment versus in- 
dividual deals with the integration of 
the available data. As in the case of 
group observations, pooled predic- 
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tions are more accurate than most or 
all of the individual predictions 
(Klugman, 1947; Luborsky & Holt, 
1957; Travers, 1941). In one study 
(Smith, 1932) of assessing the qual- 
ities of a child on the basis of behav- 
ioral data, the accuracy increased 
with an increase up to 50 of the num- 
ber of assessors whose ratings were 
pooled (there were only 50 assessors 
available). 

Does discussion prior to assess- 
ment increase accuracy? The evi- 
dence on this suggests that it does not 
(Rusmore, 1944; Taylor, 1947; Kelly 
& Fiske, 1951; Oldfield, 1947). As 
Oldfield puts it: ‘Discussion of the 
merits of candidates merely amounts 
to a somewhat clumsy method of 
averaging the individual judgments 
of the members’ (1947, p. 129). 
Whether discussion aids accuracy or 
not appears to depend on the quality 
of the persons who dominate the dis- 
cussion either through their position 
in the group, their personality, or 
their professional standing. Discus- 
sion is justified particularly when 
there is an ‘‘expert”’ as chairman, who 
will actually make the final decision 
in an autocratic manner, but who 
calls on the other members of the 
panel to give him the benefit of their 
opinions. An “‘expert”’ is defined, for 
this purpose, as a person who is ex- 
perienced both in assessment and in 
the criterion situations. 

Kelly and Fiske are quite pessimis- 
tic regarding the use of multiple as- 
sessors. ‘Until some of the major 
sources of error in predictions are 
eliminated, the replications of asses- 
sors and the use of staff conferences 
hardly seems justified for this type of 
prediction” (1951, p. 178). This con- 
clusion is too sweeping. As we can 
see from Table 1, both pooled and 
committee (discussion) ratings have 
justified themselves in some studies. 
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Let us conclude this section on the 
“safety in numbers’ assumption 
with a proposal to combine the ad- 
vantages of both multiple techniques 
and multiple assessors. The sugges- 
tion is that each assessor be given a 
limited amount of information on 
which to base his assessment judg- 
ments about the candidates, each 
assessor to receive different informa- 
tion. The assessments will then be 
pooled arithmetically. The informa- 
tion supplied may be objective or 
subjective, atomistic or molar, and 
may range from one item of life- 
history, or a test result, to a projec- 
tive test protocol, an interview or the 
observation of behavior in a minia- 
ture situation. This procedure would 
enable a vast amount of data to be 
integrated without problems of 
weighting since unit weights for each 
assessor's contribution would be ade- 
quate—this would be analogous to an 
inventory that gives unit weight to 
each item. With adequate organiza- 
tion of the assessment program, this 
would permit several assessors to 
contribute to the final assessment so 
that different viewpoints and _ per- 
sonality theories can be represented. 
This approach seems to be at least 
worth experimenting with. 

Even if it is found that increased 
numbers of assessors increases per- 
ceptibility the accuracy of the assess- 
ments, there is still a fine calculus of 
cost in human time and effort to be 
computed. The decision to augment 
the panel with additional assessors 
is a function, among other things, of 
the gradient of diminishing returns, 
the ability of available extra asses- 
sors, the cost of using them, the ef- 
fects on the candidates, and the desire 
to allow executives in the institution 
to participate in the assessment. The 
proposal made above of having many 
assessors, who contribute small pieces 
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of information, may make it possible 
to conduct multiple assessments com- 
paratively cheaply. 


SUMMARY 


Multiple personality assessment 
procedures have been analyzed with 
respect to their primary purpose and 
the validation strategy used. Prob- 
lems that arise in the attempt to use 
personality assessment for selection 
were discussed with respect to the 
problem of clinical versus statistical 
predictions, the problem of condi- 
tional factors that affect the criteria, 
and the value of using multiple tests 
and more than one assessor. 

Some recommendations: 

1. Use objective techniques as far 
as possible for analyzing the criterion, 
for scoring tests, and for making pre- 
dictions. 

2. Give careful consideration to re- 
quirements of the criterion and make 
empirical studies of the link between 
these requirements and both the test 
behavior and the criterion behavior. 
This is a step in construct validation. 

3. As a preliminary to the above 
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type of criterion analysis, nonmediate 
empirical or clinical methods of pre- 
diction may be used. 

4. Subjective observations should 
be made by several observers whose 
opinions should be pooled arithmeti- 
cally. 

5. The assessors should be selected 
for proven ability to make accurate 
judgments in the assessment situa- 
tion, i.e., they should be validated. 

6. The assessors should be familiar 
with the criterion situation, and 
should take this situation into ac- 
count when they make the predic- 
tions. 

7. Each assessor should be given 
no more than two or three units of 
information; there should be a large 
number of assessors whose predic- 
tions are pooled arithmetically, and 
without discussion. 

8. In selection assessments, if com- 
mittee decisions are desired, the 
assessors who are particularly well- 
experienced in the criterion situation 
should be given special influence in 
forming the final decisions, provided 
they have been shown to possess good 
ability to judge persons. 
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The word “‘projection”’ stems from 
the Latin verb projectus, meaning ‘‘to 
cast forward” (Bell, 1948). In the 
field of personality one would be hard 
pressed to find a concept so capable 
of multiple interpretation and so 
varied in meaning as the concept of 
projection. Surely, this concept has 
had more interpretations than the 
smile of Mona Lisa. Sears described 
the situation thus: ‘‘Probably the 
most inadequately defined term in all 
psychoanalytic theory is projection”’ 
(Sears, 1943, p. 121). Murray has 
said “If ‘projection’ means every- 
thing it means nothing’ (Murray, 
1951, p. 13). 

Just how many kinds of projection 
are there? It is frankly impossible to 
describe them all. The best one may 
hope for is to group them into more 
or less distinct This 
paper, then, will concern itself with 
attempting to describe 
“types” of projection, their origins, 
and the research undertaken with re- 
gard to them. 

Only the various kinds of projec- 
tion subjected to experimental in- 
vestigation will be dealt with in de- 
tail. Because they have resulted in 
little research, the conceptual frame- 
works of Bellak (1950, 1954, 1956), 
Murray (1938, 1951), Van Lennep 
(1951, 1957), Rapaport (1942, 1945, 
1952), Schachtel (1950), and 
(1957) will not be considered. 

The research literature seems to 
indicate some four possible cate- 
gorizations of the concept of projec- 
tion: ‘‘classical,”’ ‘‘attributive,’”’ ‘‘au- 


categories. 


various 


Goss 


tistic,”’ and ‘‘rationalized’’ projec- 
tion. Under the category ‘“miscel- 
laneous’’ we will review those studies 
in which projection was undefined, or 
in which several different concepts 
were simultaneously investigated. 
Classical Projection 

The concept of projection was 
known several centuries before the 
appearance of Freud. Thus Thomas 
a Kempis stated, ‘‘What a man is in- 
wardly that he will see outwardly” 
(Cattell, 1951). The Malleus Male- 


ficarum, written in the Middle Ages, 
gives a clear example of projection. 


For fancy or imagination is as it were the 
treasury of ideas received through the senses. 
And through this it happens that devils so 
stir up the inner perceptions, that is the power 
of conserving images, that they appear to bea 
new impression at that moment received from 
exterior things (Zilboorg, 1935, p. 54). 


At the close of the 19th century, 
Freud gave the following definition 
of projection: ‘‘The psyche develops 
the neurosis of anxiety when it feels 
itself unequal to the task of mastering 
(sexual) excitation arising endoge- 
nously. That is to say it acts as if it 
had projected this excitation into the 
outer world” (Bellak, 1944, p. 353). 
With this definition, the use of the 
concept soon became quite wide- 
spread. 

The view of “classical” projection 
most often held currently by many 
psychologists is: “A situation in 
which the ego feels threatened is 
likely to result in the ego’s refusing to 
acknowledge the trait and in the sub- 
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sequent attribution of the trait to 
the outside world’”’ (Murstein, 1956, 
p. 418). Adherents to this view are 
many, though they differ as to the 
wording used. Only a representative 
sampling is listed (Healy, 1930; Hoff- 
man, 1935; Jelgersma, 1926; Kauf- 
man, 1934; Knight, 1940; Schafer, 
1954; Schaffer, 1945; Symonds, 1949; 
Warren, 1934). The possibility of the 
projection of objectively favorable 
traits has been mentioned by Muhl 
(1943), Hoop (1924), and Janet 
(1947). 


Attributive Projection 


“Attributive projection” has been 
described by many psychologists, in- 
cluding Cameron (1951). <A recent 
definition is: ‘‘The ascribing of one’s 
own motivations, feelings, and be- 
havior to other persons’ (Murstein, 
1957b). It is perhaps the most popu- 
lar of the uses of projection today in 
the field of personality. The con- 
cept’s popularity rests on its broad- 
ness; i.e., unlike “‘classical’’ projec- 
tion there is no concern with the S’s 
unconscious, or self-concept. It is 
often sufficient merely to note that 
there is a correlation between some 
characteristic of the subject and some 
statement or prediction he makes 
concerning other persons. Weiss 
points out that ‘‘the term projection 
in current usage, refers to every kind 
of externalization, particularly to 
every process in which ideas, im- 
pulses, or qualities belonging to one- 
self are imputed to others” (1947, p. 
358). Nevertheless, the concept has 
strong support (Cameron: 1947, 1951; 
Dymond, 1950; Munn, 1946). Even 
Freud, in Zotem and Taboo said (as 
noted earlier by Bellak, 1956): 

But projection was not created for the pur- 
pose of defense; it also occurs where there is no 
conflict. The projection outwards of internal 
perceptions is a primitive mechanism, to 
which, for instance, our sense perceptions are 
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subject, and which therefore normally plays 
a very large part in determining the form 
taken by our external world. Under condi- 
tions whose nature has not yet been suff- 
ciently established, internal perceptions of 
emotional and thought processes can be pro- 
jected outwards in the same way as sense per- 
ceptions, they are thus employed for building 
up the external world, though they should by 
rights remain part of the internal world 
(Freud, 1955, p. 64). 


Freud's statements are, thus, 
amenable to an attributive definition 
of projection while comprehending 
projection as an organizing aspect of 
perception. 

Horney also described a nondefen- 
sive kind of projection which, in part, 
is ‘‘not essentially different from the 
tendency to assume naively that 
others feel or react in the same man- 
ner as we ourselves do”’ (1939, p. 26). 
An example of ‘‘naive’’ projection is 
mentioned by Baldwin (1955) in 
describing the child who in a fit of 
anger threatens to maim, murder, 
and demolish another child, and as a 
crowning imprecation, threatens not 
to invite his adversary to his birth- 
day party. 


Autistic Projection 


Perception which is strongly influ- 
enced by the needs of an individual, 
in that the figural aspects of the per- 
ceived object are modified so as to be 
consistent with the need, may be re- 
ferred to as ‘‘autistic’’ projection. 
Murphy (1947, pp. 338 ff.) wrote ‘‘so 
wherever our needs differ we literally 
see differently. Much of the process 
of individual perception depends 
upon the force of past wants, the per- 
son's need to disentangle and restruc- 
ture in terms of the situations with 
which he has had to cope.” Sears 


(1943, p. 121) has said, ‘it may be 
said in general, that the presence of a 
need or drive provides the antecedent 
condition for the perception of ob- 
jects related to that need or drive.” 
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Rattonalized Projection 


This type of projection is similar 
to ‘classical’ projection in that the 
projective process is held to be un- 
conscious. The projector, however, is 
conscious of his behavior. He at- 
tempts to justify it by inventing a 
rationale. Thus, the person caught 
buying on the “black market” says 
in self-justification, ‘‘everybody else 
is doing it.’’ Here, the attempt is to 
convert neurotic anxiety about doing 
something wrong into objective anxi- 
ety about not getting enough to eat 
(Murstein: 1956, 1957b). Baldwin 
(1955) states: 

Finding a child disobedient and unloving 
justifies resentment. This introduces us to 
another sort of defense against guilt—namely 
to justify the rejection so that it no longer 
evokes guilt feelings. This attribution of 
blame to the people toward whom we feel 


hostile is a defense mechanism called ‘‘projec- 
tion’”’ (Baldwin, 1955, p. 498). 


Among other supporters of this 
kind of projection are Allport (1939), 
(complementary projection), Fen- 
ichel (1945), The Psychiatric Dic- 
tionary (1940), and Piaget (1926), 
(projection de réciproque). Van Len- 
nep (1951) has referred to this oc- 
currence not as projection, but as its 
correlate. Thus, if a person is fright- 
ened, in order to project, he would 
have to see the object as frightened. If 
he sees the other as the cause of his 
fright and thus frightening, he is per- 
ceiving the correlate of his fright, but 
he is not projecting. Again, however, 
we find Freud ‘in Totem and Taboo 
giving plausibility to a rationalizing 
kind of projection. He states: 

It cannot be disputed that this process of 
projection, which turns a dead man into a 
malignant enemy, is able to find support in 
any real acts of hostility on his part that may 
be recollected and felt as a grudge against 
him: his severity, his love of power, his un- 
fairness, or whatever else may form the back- 


ground of even the tenderest of human rela- 
tionships (1955, p. 63). 
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At this point it may be of value to 
examine the experimental findings 
with regard to the aforementioned 
kinds of projection. 


EXPERIMENTAL FINDINGS 
Classical Projection 


Sears (1936) was among the first of 
the psychologists to utilize a quanti- 
tative index of projection. He studied 
male college students’ possession of 
the traits of “stinginess,” ‘‘obstin- 
acy,” ‘“‘disorderliness,’’ and ‘‘bash- 
fulness,’’ by the pooled rating method. 
Projection was measured as follows: 

(1) The degree to which each individual 
demonstrated a given trait was determined by 
averaging the combined ratings assigned to 


him on that trait by other members of his 
house. (2) The amount of a given trait at- 


tributed to others was obtained by averaging 
the ratings a given individual assigned to the 
other members (Sears, 1936, p. 153). 


Projection would have occurred if, 
for example, stingy, noninsightful 
persons saw more stinginess in others 
than did the total group as a whole. 
The results, however, did not support 
a Freudian (‘classical’) concept of 
projection, in that projection oc- 
curred for both acceptable and non- 
acceptable (reprehensible) _ traits. 
Moreover, in the group possessing in- 
sight, a negative correlation was 
found between the strength of the 
trait and the amount attributed to 
others. Sears called this occurrence 
“contrast formation.” Its effect was 
said to be opposite to that of projec- 
tion. 

One might have wished that “‘pro- 
jection” and “‘insight’”’ had not been 
treated dichotomously. The crude- 
ness of measure due to the lack of a 
specific quantitative score other than 
“projected” or “did not project” 
defies any attempt at extensive anal- 
ysis. Further, the appearance of a 
spurious element in the data, a fact 
noted by Rokeach (1945), and Calvin 
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and Holtzman (1953), places Sears’ 
findings in doubt. The difficulty lies 
in the fact that projection as defined 
by Sears was a function of the group 
rating (G), and the difference be- 
tween the group- and self-ratings 
(G-S). Since G appeared in both (G) 
and (G-S), a spurious correlation 
would be expected between (G) and 
(G-S), even if no actual psychologi- 
cal relationship existed. Hence, the 
elect of G should have been par- 
tialled out, something which was not 
done. 

Zucker (1952) measured projection 
by summing the number of items in 
which an individual (college student) 
said that other people behaved in a 
certain way, but he, the student, did 
not. The approach seems question- 
able. No mention was made of the 
actual possession by the Ss of the 
trait in question. Hence, deviates 
from the population norm would have 
been considered as projectors through 
the use of this methodology. Under 
these circumstances Zucker’s findings 
that “high projectors’’ had more 
ideas of reference and were more 
ascendant on the Allport Ascendant- 
Submissive scale than “low  pro- 
jectors,’’ seems meaningless. 

Zimmer (1955) presented students 
with three photos and had them 
choose the ones liked best and least. 
The selected photos were then rated 
on a 7-point scale as to possession of 
some 25 traits. The Ss then rated 
themselves on a scale containing the 
same traits. Finally, measures of 
conflict with regard to each of these 
traits were obtained from a word 
association test. 

Zimmer’s hypotheses, that accept- 
able personality characteristics are 
projected onto liked individuals, and 
unacceptable characteristics are pro- 
jected onto disliked persons, and that 
the strength of projection is a func- 
tion of the degree of conflict, were all 
substantiated. Similarly, Lundy and 
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Berkowitz (1957) found that students 
whose attitudes were influenced by 
peers tended to perceive themselves 
as more similar to these peers than 
students who were negative to peer 
influence. 

Norman, in a series of experiments 
with co-workers Ainsworth (1954), 
and Leiding (1956), investigated the 
relationship of such variables as “‘pro- 
jection,” “empathy,” and “reality,” 
and found the correlations given in 
Table 1. 

These results apparently indicate 
that the various kinds of projection 
are detrimental to the accurate per- 
ception of others, but they have been 
shown to be an artifact of the pro- 
cedure by Cronbach (1955), Gage 
and Cronbach (1955), and Murstein 
(1957a, 1957b). The criticism by 
Murstein (1957a) of their procedure 
is based on their definitions of pro- 
jection, and is as follows: 

Norman and Ainsworth Projection: 
“A” says that he does not possess a 
certain trait (he answers ‘‘no”’ to one 
of the questions on the GAMIN). 
“A” says that other students of his 
age and sex do possess that trait. Of 
the remaining members of the group 
(college students) from whence “A” 
stems, 51% or more say that they do 
not possess the trait in question. 

Norman and Leiding, Projection A: 
This measure of projection may be 
more readily understood by means of 
an alphabetical shorthand used to 
describe each step (a, b, c, d, -- -). 

(a) “A” rates ‘B” as he thinks “B’’ would 

rate himself, 

(b) “A” rates himself (‘‘A’’) as he thinks 

“B” would rate him. 

(c) “A” rates “B” as he (“A”) sees “B.” 

(d) ‘‘A”’ rates himself (‘A’’). 

Projection = (a—c) plus (d—b) 


Norman and Leiding, Projection B: 


(e) “A” says most other people will answer 
a given question in a certain way. 

(f) “A” answers in the same way in judging 
himself (“‘A’’) as he predicted that most 

A “pro- 


other people would answer. 
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rABLE 1 


INTERCORRELATIONS OF “PROJECTION,” “EMPATHY,” AND “REALITY” TAKEN 
FROM THE NORMAN AND AINSWORTH (1954) AND 
NORMAN AND LEIDING (1956) StuDIES 


Authors Variables 





Norman and Ainsworth 
Norman and Ainsworth 
Norman and Leiding 
Norman and Leiding 
Norman and Leiding 





Projection vs. Reality 

Projection vs. Empathy 
Projection A vs. Refined Empathy 
Projection B vs. Raw Empathy 


| 
Projection B vs. Refined Empathy | .69 





jection” point is achieved each time e 
and f occur together for the same ques- 
tion. Thus, Projection =X(e, f). 

The projection score of Norman 
and Ainsworth may be objected to 
(a) because of the use of the subject’s 
self-rating as an objective criterion of 
whether or not that subject possessed 
a given trait. This appears hazard- 
ous, particularly in a study measur- 
ing projection. Since the criterion 
for the group was 51% or more, if 
only 2% of the Ss projected, the item 
would have been placed incorrectly 
and thus distorted the measure. of 
projection; (b) because of the absence 
of any reliability coefficients for the 
various judgments; (c) because some 
of the traits on the GAMIN have 
moderately high intercorrelations, 
thus lending a degree of spuriousness 
to the results; (d) because in correlat- 
ing ‘Projection’ and ‘‘Empathy” 
there is a common component (others 
say they possess the trait) in both 
variables which spuriously inflates 
the resulting correlation. 

Norman and Leiding used 
other measures of projection which 
have been labeled by the present 
authors as Projection A and B. Using 
the aforementioned alphabetical 
shorthand for operations, their cor- 
relation of Projection with Refined 
Empathy (Raw Empathy minus Pro- 
jection), for which a correlation of 
—.47 was reported, may be described 
as follows: 


two 


Similarly for Projection B, the 
correlation of Projection with Raw 
Empathy (r=.86) may be described 
as follows: 


Projection 


Raw Empathy 
—_ vw 
Z(e, f) L(e, 1) 

We have omitted the descriptions of 
g, h, and 7 in the interest of space 
since they are not crucial to our dis- 
cussion. 

In the first of these two correla- 
tions, the position of common com- 
ponents is such as to insure the fact 
that as the projection score increases, 
the refined empathy score must de- 
crease due to the common 
ponent [(a—c)+(d—b)}]. Hence, a 
spurious negative correlation is quite 
expected. In the second of the cor- 
relations, there is again an occurrence 
of identical components, but, here, 
since no component is subtracted, 
the spurious correlation which re- 
sults is positive. 

Murstein (1956), by means of 
pooled ranks, selected four personal- 
ity groupings (hostile-insightful, hos- 
tile-noninsightful, friendly-insightful, 
and friendly-noninsightful). Using 
the Rorschach Hostility Scale as a 
measure of the hostile content of the 
Rorschach, he found that hostile, 
insightful, people projected more hos- 
tility than any other grouping. Ina 
dynamic ego-threatening situation, 
however, the hostile, noninsightful, 


com- 


Projection vs. Refined Empathy (Raw Empathy — Projection) 
[((a —c) +(d — b)] vs. [(a — g) + (b —h)] — [(a —c) + (d — )] 
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group manifested ‘classical projec- 
tion’’ as expected, but, surprisingly, 
the friendly, insightful, group also re- 
acted strongly to threat, by distort- 
ing the examiner's behavior in per- 
ceiving him as hostile. Thus, the re- 
sults did not wholly jibe with a 
“classical” conception of projection 
but were more amenable to analysis 
in a phenomenological frame of refer- 
ence. 

Both the hostile, noninsightful, 
and friendly, insightful, groups per- 
ceived themselves as friendly. Ob- 
jectively speaking, the members of 
the friendly, insightful, group were 
correct, the members of the hostile, 
noninsightful, group were in error. 
What was important, however, was 
the way in which each individual per- 
ceived himself. The experimental 
findings are consistent with the belief 
of Lecky, that “any value . . . which 
is inconsistent with the individual’s 
valuation of himself cannot be assim- 
ilated; it meets with resistance and is 
likely, unless a general reorganization 
takes place, to be rejected’”’ (Lecky, 
1951, p. 153). 

In a quite similar design, Page and 
Markowitz (1956) obtained a de- 
fensive and nondefensive population 
from their responses to MMPI items. 
Again, a test of defensiveness based 
on receiving critical comments with 
regard to performance on a pseudo 
‘intelligence test’? did not signifi- 
cantly differentiate the defensive and 
nondefensive persons. Under per- 
sonal ego-threat, the defensive per- 
sons projected more hostility on the 
examiner than did the nondefensive 
persons. The nondefensive persons 
did, however, show a nonsignificant 
tendency toward projection. 

These two experiments seem to 
support the concept of ‘‘classical’’ 
projection in indicating that defen- 
sive persons do project under ego- 
threatening situations, though they 
may not do so on projective and 
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paper-and-pencil tests. The failure 
of Page and Markowitz to find sig- 
nificant projection on the part of the 
nondefensive group may have been 
due to their different method of meas- 
uring defensiveness, as well as to the 
fact that their group consisted mainly 
of women while Murstein’s consisted 
of men. Lastly, the sex of their ex- 
aminer was not indicated, and may 
have been a woman, while in Mur- 
stein’s study the examiner was a 
man. It seems quite plausible to be- 
lieve that the sex of the subject, as 
well as that of the examiner, influ- 
enced the kind of response elicited. 

It is difficult to draw conclusions 
from the aforementioned “classical” 
projection studies due to the varying 
methods of measuring the concept. 
Murstein’s work pointed out the in- 
fluence of background factors for the 
manifestation of projection, as well 
as the occurrence of projection with 
both “‘friendly’”” and “‘hostile’’ per- 
sons. It would appear, therefore, that 
the mechanism of projection is best 
understood as a means of attaining 
self-consistency rather than solely as 
a defense mechanism. Within this 
framework, the denial and _ subse- 
quent projection of favorable traits 
may be readily understood. The 
“tough guy’’ may vigorously repress 
any inclination of “humanity’’ to- 
wards others since such behavior 
would be inconsistent with his self- 
perceived role. 


Attributive Projection 


By far the greatest amount of re- 
search has been undertaken within 
the confines of this concept. Sears 
(1937) found a correlation of .64 be- 
tween an inventory measuring self- 
criticism and one tapping ideas of ref- 
erence. Apparently, self-critical per- 
sons perceived others as also being 
critical of them. Thomsen (1941) 
found that in the 1940 presidential 
election, the majority of persons 
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sampled were of the opinion that the 
candidate whom they favored would 
win the election. Wallen (1941) 
sampled 85% (N=237) of the stu- 
dents in a small residential college 
asking them to estimate the percent- 
age of students in the college who 
held certain opinions on each of three 
current issues (war entry, draft, St. 
Lawrence seaway), and in addition 
to state their own views. A signifi- 
cant proportion overestimated in the 
direction of their own opinions. 

Goldings (1954) had 20 Harvard 
students rate pictures of college 
students on a 10-point scale of happi- 
ness, and then rate themselves on a 6- 
point scale. Some relationship was 
found between the self-rating and 
ratings of the pictures. A tendency 
was found, however, for individuals 
who avowed extreme happiness, to 
attribute less happiness to others. 
This effect may have been a function 
of regression to the mean, low relia- 
bility, and the ‘‘ceiling effect’’ of 
“happy persons.” It will be readily 
apparent that no “extremely happy”’ 
person (rated 10 on a 10-point scale) 
could perceive anyone as “happier” 
than himself; only ‘equally happy” 
or “less happy’ persons might be 
perceived, making for a negative cor- 
relation between self and others for 
such persons. 

Mintz (1956) found that children 
who had just seen the film ‘Peter 
Pan”’ tended to perceive Peter Pan’s 
age as close to their own. The cor- 
relation for boys was .34, and for 
girls .68, both significant. 

Halpern (1955) had 38 student 
nurses take the GAMIN personality 
inventory, and had them predict the 
responses of other nurses in their im- 
mediate subgroup. Predictions for 
referents having self scores similar to 
their own were significantly more ac- 
curate than were predictions for dis- 
similar persons. The r for similarity 
vs. predictive accuracy was .84. Sim- 
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ilar findings are reported by Suchman 
(1956), and Wittich (1956). Alfert 
(1958) found that when item re- 
sponses were socially desirable, ‘“‘at- 
tributive”’ projection (Alfert used the 
term ‘‘assumed similarity’’) increased 
for traits congruent with the perceiv- 
er’s ideal more than items involving 
ideal-discrepant traits. Though the 
persons whose responses were pre- 
dicted were comparative strangers to 
the predictor, they were probably of 
similar education and socioeconomic 
status. Thus, Alfert’s findings that 
more items involving traits congru- 
ent with the S’s ideal self were at- 
tributed to these “strangers’’ than 
were said to be possessed by the S 
himself seems understandable. One 
may wonder whether such favorable 
perceptions of others would have oc- 
curred among persons of another so- 
cial or economic group. 

Fiedler (1951, 1952, 1953) found 
that persons liked were assumed to be 
more similar to the self than persons 
disliked. Good therapists, for ex- 
ample, showed a greater tendency to 
assume similarity unjustifiably be- 
tween patients and themselves than 
was the case for poor therapists. 
These studies seem to indicate the in- 
dependence of the concept of “at- 
tributive’’ projection from defensive 
behavior. 

The above findings clearly dem- 
onstrate that people often predict 
that other persons, usually of a sim- 
ilar occupational or social group, hold 
many views similar to their own. 
Moreover, these predictions are often 
better than chance. This finding, 
however, may be subsidiary to the 
fact that many of the experiments in- 
volved situations where there was 
little ego-threat. In addition, most of 
the studies involved rather homo- 
geneous groups where, due to the sim- 
ilar personality patterns of the mem- 
bers, the predictions of accurate pre- 
dictors would undoubtedly show cor- 
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relations with their own self re- 
sponses, thus automatically making 
them ‘‘projectors.” 

Lundy (1956) has shown that when 
the center of attention is on the self, 
projection is more likely to occur 
than when it is on the other person. 
When, however, an _ ego-involving 
situation is at stake, it might be ex- 
pected that individuals would be 
more careful in making distinctions 
between themselves and _ others. 
Thus, Rokeach (1945), studying the 
ego-involving attribute of “beauty” 
among a female population at Brook- 
lyn College, found a nonsignificant 
correlation of —.08 between self- 
ratings and ratings ascribed to other 
females. 

The role of self-involvement as dis- 
tinct from insight is sharply illus- 
trated in a study by Weingarten 
(1949). She had a group of 74 college 
students write autobiographies of 
themselves. Two judges used the 
autobiographies to rate the subjects 
on “tension” and ‘‘insight’’ with re- 
gard to self, family, and the non- 
familial social environment. The Ss 
also were given a series of 75 state- 
ments describing behavioral incidents 
and asked to interpret the psycho- 
logical import of these incidents. Pro- 
jection was measured by the correla- 
tion between the judges’ ratings of 
the subject’s tension as expressed in 
his autobiography, and his inter- 
pretation of the behavioral incident 
inventory. The results appear in 
Table 2. From these correlations it is 
apparent that ‘‘attributive’’ projec- 
tion is not necessarily related to in- 
sight. Where the “‘self’” was involved, 
both high- and low-insightful groups 
tended to project. Again, it appears 
helpful to view this result as signify- 
ing a “‘self’’-enhancing tendency for 


4c 


most persons regardless of their in- 
sight into their own personalities. 
By perceiving behavior in the be- 
havioral incident inventory as sim- 
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TABLE 2 
““PROJECTION’’ CORRELATIONS FOR HIGH- AND 
Low-INnstGHt GROUPS FOR SELF, FAM- 
ILY, AND SOCIAL ENVIRONMENT 
(WEINGARTEN, 1949) 




















| High- | Low- 
Variable | Insight | Insight 
| (N=24) | (N=33) 
r | or 
Self 41 | (38 
| 
Family 00 | 40 
Social Environment By | 4 








ilar to their own, as described in their 
autobiographies, the Ss tended to 
justify their own behavior as similar 
to the norm. 

A welcome relief from the omni- 
present college population is afforded 
in a study by Friedman (1955). He 
used three groups, 16 normals, 16 
psychoneurotics, and 16 paranoid 
schizophrenics, all of whom Q-sorted 
items pertaining to their ‘‘phenom- 
enological” self and their ‘‘ideal’’ 
self. In addition, they were presented 
with five TAT cards and their result- 
ing themes were rated as to the pres- 
ence of “‘projected”’ self. 

The findings revealed significant 
relationships between ‘“‘phenomeno- 
logical’”’ self sorts, and “projected” 
self on the TAT, for the normal and 
the neurotic groups, but not for the 
psychotic one. Considering the 
“ideal” self sort vs. ‘‘projected”’ self 
on the TAT, only the correlation of 
the normal group proved to be 
significant. The correlations revealed 
no significant differences between 
normal and neurotic population be- 
havior concerning the ‘‘phenomeno- 
logical” self’ and TAT protocols. 
Both groups showed a tendency to 
reveal something about themselves 
on the TAT, probably similar to that 
given on the Q sort. While the nor- 
mal group projected their ‘‘ideals’’ 

















THE CONCEPT OF PROJECTION 


on the TAT cards, however, the 
neurotic group did not. Once again, 
the manifestation of projection 
seemed to be a function of self con- 
cept. The more anxious neurotic 
group was less consistent in its mani- 
festation of ‘‘self.”’ 

Some writers have attempted to 
improve the measurement of projec- 
tion by more quantitative methodol- 
ogies than the earlier writers used. 
Rokeach (1945), for example, in his 
study of beauty, designated as in- 
sightful those subjects who in the 
beauty group slightly overestimated 
themselves, and in the homely group 
slightly underestimated themselves. 
His reasoning was that regression to 
the mean made the group judgments 
a slight underestimation of the true 
score of the individual. But, regres- 
sion to the mean is a function of re- 
liability; the greater the reliability, 
the less the regression toward the 
mean. Since no reliability coefficients 
were obtained with regard to the 
group judgments, Rokeach’s method 
seems questionable. 

Bender and Hastorf, in a series of 
experiments, sought an exact quan- 
titative approach to projection (1950, 
1952, 1953). A typical experiment 
(1952) required 50 students to fill out 
an adjustment scale for themselves 
and for the way they thought their 
friends would rate themselves. The 
variables were defined as follows: 

Projection: The total item-by-item 
deviation of the forecaster’s own re- 
sponses from his predictions for an 
associate. Empathy: The deviation 
between the forecaster’s prediction 
for his associate and his associate's 
self-rating. Refined Empathy: Em- 
pathy minus Projection. Similarity: 
The deviation of a forecaster’s self- 
rating from his associate’s self-rating. 
The correlations obtained, are shown 
in Table 3. 

Similarly, Cowden (1955) in a 
study of married couples, also found 
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§ on = 
rABLE 3 
CORRELATION OF “PROJECTION” WITH “‘SIM- 
ILARITY,” “‘EMPATHY,’’ AND “REFINED 
Empatuy” (BENDER AND Hastorr, 1952) 








Variables | r 
spiipeeeediadiens teatime — 
Projection vs. Similarity .32 
Projection vs. Empathy .37 
som .58 


Projection vs. Refined Empathy 


a substantial correlation between 
Projection and Empathy. 

The results of both studies seem to 
imply that the more similar the per- 
sonalities of two individuals, the 
more likely is projection apt to occur 
when one individual attempts to 
evaluate the personality of the other. 
The ability to see an individual as he 
sees himself (empathy) is related to 
the tendency toward projection. If, 
however, the projection component 
is removed from the score of Em- 
pathy, then a negative relationship 
exists between these two variables. 

The conclusions, however, are an 
artifact of the statistical procedure. 
A contributing factor to the spurious 
relationship between projection and 
empathy as defined above is the fact 
that each of these scores had an 
identical component (prediction for 
an associate). The other scores illus- 
trate similar common components 
when they are correlated with each 
other. An objection might also be 
made on purely logical grounds to the 
assumption that projection has oc- 
curred if a person attributes a trait to 
another which he possesses himself 
(Murstein, 1957b). 

In a more recent article, Hastorf, 
Bender, and Weintraub (1955) 
brought out another difficulty with 
their earlier work. People who have a 
tendency to give end scale responses 
(almost always, almost never), and 
who also accurately predict midscale 
personalities, assure themselves of 
high Refined Empathy scores. The 
reason for this is that Refined Em- 
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pathy is equal to Raw Empathy 
minus Projection. Hence, the more 
dissimilar the personalities, the small- 
er the projection component sub- 
tracted, and the larger the Refined 
Empathy score. 

By way of confirmation, the au- 
thors reported a rho coefficient of .47 
between the tendency to give end 
scale scores and the Refined Em- 
pathy score. Conversely, a man who 
used the midpoint of the scale most 
frequently in his predictions for 
others, and also used midpoint pre- 
diction for himself, automatically 
became a projector. In short, projec; 
tion under these circumstances was a 
function of response habit rather 
than any aspect of personality or 
cognition. 

The independence of response hab- 
it and predictive efficiency is in- 
dicated in a study by Leventhal 
(1957). On the basis of the number 
of descriptions used to describe their 
friends, he derived groups of per- 
ceptually ‘simple’ and ‘‘complex’”’ 
judges. ‘‘Simple’’ judges were more 
apt to assume similarity between 
themselves and others in their pre- 
dictions than were ‘‘complex’”’ judges, 
who tended to differentiate between 
themselves and other persons. 
Nevertheless, both groups were about 
equal in predictive accuracy. In a 
study by Crow (1957b), medical stu- 
dents subjected to training in inter- 
personal relationships manifested 
greater complexity in their predic- 
tions of patients’ MMPI profiles (i.e., 
the variance of their predictions in- 
creased as compared to their pretrain- 
ing predictive variance), than they 
had prior to this training. Neverthe- 
less, their accuracy dropped as a re- 
sult of this training. Apparently, by 
overstressing the “individual,” the 
basic similarity of persons or at least 
of their answering habits may be un- 
derestimated. 

In a study involving a plethora of 
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discrepancy scores, Fabian (1954) de- 
fined projection as the rating of 
“others’’ by a subject, minus the 
mean rating of the subject by the 
group. Insight was measured by the 
self-rating minus the mean rating of 
S by the group. Again, because of 
the component common to both 
measures (mean rating of S by the 
group), a (spurious) positive correla- 
tion was reported for projection and 
insight. 

Bieri and his co-workers (1953, 
1955a, 1955b) have been interested in 
the relationship of the interaction of 
persons and changes in perception. 
In an early study, Bieri (1953) found 
that after interaction for some 20 
minutes, college students’ ‘‘assimila- 
tive’ projection scores (predicting 
the same responses for another as one 
fills out for oneself) tended to increase 
significantly. The test used to meas- 
ure these predictions was the Rosen- 
zweig P-F scale. Not surprising was 
the fact that, although a chance score 
would have been 8, the mean score 
was approximately 13.5 even before 
any interaction. In other words, the 
students could predict one another's 
responses with considerable accuracy 
even before they interacted! This 
matter will be discussed in greater 
detail later. 

In another study, Bieri (1955a) in- 
vestigated the relationship between 
the behavioral traits used in describ- 
ing persons (cognitive complexity), 
and the operational constructs of 
predictive accuracy and ‘‘assimilative"’ 
projection. Predictive accuracy was 
composed of “accurate” projection 
(wherein ‘‘A’”’ correctly predicts a re- 
sponse for ‘“‘B’’ which is the same as 
“‘A’s” self prediction), and accurately 
perceived differences between fore- 
caster and forecastee. ‘‘A ssimilative’’ 
projection was composed of “accu- 
rate’ projection and “inaccurate” 
projection. 

The results in Table 4 seem to 
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TABLE 4 
CORRELATIONS BETWEEN COGNITIVE CoM- 
PLEXITY AND VARIOUS PERCEPTUAL 
Scores (Brier, 1955) 





| ne ae 
Predictive Behavior | Cognitive 








Complexity 
r 
Predictive accuracy .29 
“Assimilative” projection — .32* 
“Accurate” projection .02 
Accurate perceived differences eg 
“Inaccurate” projection — .40* 
Actual similarity | .20 





* Significant at .05 level. 


show that lack of cognitive com- 
plexity in the perception of others 
leads to the perception of others as 
similar in response habit to the fore- 
caster, often with a resulting loss in 
accuracy of perception. The study is 
somewhat weakened by the fact that 
several of the correlations are pre- 
determined, and _  nonindependent 
from the other correlations. Thus, 
given that the correlation involving 
accurate perceived differences is posi- 
tive, it is most probable that the cor- 
relation containing “‘imaccurate’’ pro- 
jection is negative. The reason for 
this may be readily seen. Consider- 
ing all of the responses where the 
forecaster and forecastee differ in 
their self-responses, the following 
truism emerges: if the forecaster ac- 
curately perceives these differences, 
then he does not inaccurately per- 
ceive the forecaster as similar to him- 
self. In other words, he does not per- 
ceive inaccurately. Hence, anyone 
with a high accurate perceived dif- 
ferences score must consequently 
have a low “‘inaccurate”’ projection 
score. If both variables are them- 
selves correlated with the same vari- 
able (cognitive complexity of per- 
ception), then a positive correlation 
for accurate perceived differences (.35) 
makes it probable that there will bea 
negative correlation for ‘‘inaccurate”’ 
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projection (—.40). In the foregoing 
table, therefore, few correlations may 
be selected without determining the 
others to some degree. 

In another study, Bieri, Blachar- 
sky, and Reid (1955) used the Incom- 
plete Sentences Blank (ISB), and the 
Manifest Anxiety Scale (MAS), as 
indicators of adjustment. They pre- 
dicted a negative relationship be- 
tween degree of maladjustment and 
accuracy of predicted behavior, and 
a positive relationship between the 
degree of maladjustment and the 
tendency towards “‘assimilative’’ pro- 
jection. The Ss were 33 college men 
and 7 college women. The results 
were opposite to those predicted. 
The better adjusted individuals used 
“assimilative’’ projection more than 
the more poorly adjusted, while the 
poorly adjusted were significantly 
more accurate in perceiving differ- 
ences between themselves and others. 
The better adjusted tended to per- 
ceive others like themselves (r =.71) 
and they were largely accurate in this 
perception (r=.64). The more mal- 
adjusted persons tended to perceive 
others as different from themselves 
with high accuracy. Thus, one 1s 


faced with two different subgroups hav- 


ing good predictive ability with regard 
to other persons, one group being called 
‘‘projectors,"’ and the other not. What 
more concise proof of the inadequacy 
of the definition of projection? The 
measure of projection is again a func- 
tion of the homogeneity of the group. 
In a fairly homogeneous college 
group there is a high degree of simi- 
larity between responses to various 
questionnaires, and accordingly a 
high degree of “projection” and 
“accuracy” for those able to discern 
group homogeneity. Deviates in 
terms of maladjustment are readily 
aware of the differences between 
themselves and the group, and can 
accordingly predict others’ responses 
accurately. Such persons, however, 
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rarely ‘‘project,’’ because they are 
quite different from the majority of 
the group. “Projection,” in this 
sense, is an artifact, since for both ad- 
justed and maladjusted persons the 
same degree of predictive efficiency 
receives different labels with different 


connotations. 
The evidence from experiments 
utilizing ‘‘attributive’’ projection, 


therefore, would seem to support the 
following conclusions: 

(a) The use of discrepancy scores 
has resulted in many psychological 
findings which are statistical arti- 
facts. 

(b) ‘‘Attributive’’ projection may 
be related to  ego-involvement. 
Whether projection occurs or not 
would seem to be dependent upon the 
relation of such behavior to the self 
concept. 

(c) “‘Attributive’’ projection may 
result from such diverse phenomena 
as correctly perceiving similarity be- 
tween another and oneself, or incor- 
rectly perceiving such similarity. It 
may stem either from a lack of in- 
formation, often referred to as 
“naive” projection, or from an ade- 
quate supply of information (ac- 
curately perceived similarity). In 
short, the term connotes such a 
varied number of meanings that it 
possesses little explanatory signif- 
icance without reference to the com- 
position of the group of perceivers, 
their homogeneity, and ego-involve- 
ment in the task. Also to be con- 
sidered is the variability of the ‘‘per- 
sonalities’’ to be judged as well as the 
nature of the information available 
about them. The measurement of 
“attributive’ projection is more 
likely, therefore, to reveal the cogni- 
tive-response habits of a judge than 
information about the dynamics of 
his personality. 


“‘Autistic’’ Projection 


One of the earliest recognized dis- 
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tortions in perception has been that 
which stems from the manifest needs 
of the subject. In 1904, Kiilpe found 
that “actual sensory qualities of the 
stimulus which were not relevant to 
the task-set were to all intents and 
purposes not seen”’ (Helson, 1953, p. 
22). Helson also gives the example of 
the classical ‘‘complication"’ experi- 
ment where the S watches a moving 
pointer and reports its location at the 
moment a bell is sounded. The 
stimulus for which the S is ‘‘set’’ is 
perceived prior to the incidental one. 
“Thus if the beli sounds at 20 objec- 
tively, it is seen at scale division 10 
when the pointer is attended to; if the 
sound is attended to, the pointer is 
seen at 30. This phenomenon is per- 
ceptual and is not a matter of judg- 
ment’’ (Helson, 1953, p. 22). 

Murray (1933) in some early work 
preceding the development of the 
TAT, gave some photographs to a 
group of eleven-year-old girls at a 
house party which his daughter gave. 
The girls rated the pictures once after 
a normal pleasurable experience, and 
once after a game of ‘‘murder.”’ 
There was a considerable increase in 
the degree of maliciousness attributed 
to the pictures after the second game. 
That even nonambiguous tasks may 
elicit projection of emotional states 
has been reported by Johnson (1937a, 
1937b) who found that normal but 
euphoric persons tended to overes- 
timate distances between two points, 
while normal but depressed subjects 
tended to underestimate these same 
distances. 

Sanford’s two studies (1936, 1937) 
on the effects of abstinence from food 
on imaginal processes were the pre- 
cursors to the ‘‘New Look”’ in per- 
ception. He gave 10 children words 
to associate and ambiguous pictures 
to interpret, both before and after 
meals. The subjects gave signif- 
icantly more food responses before a 
a meal than after one. McClelland 
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and Atkinson (1948) have also found 
increases in food responses as a func- 
tion of periods of deprivation not 
exceeding 16 hours. 

Levine, Chein, and Murphy (1942), 
using food deprivation periods of 3, 
6, and 9 hours, found that a deprived 
group gave more food responses after 
3 and 6 hours than at the start in re- 
sponse to chromatic ambiguous fig- 
ures projected on a screen. After the 
9th hour, however, they manifested a 
decrease. Similarly Brozek, Guetz- 
kow, and Baldwin (1951) working 
with a group of men subjected to 
semistarvation for 24 weeks meas- 
ured the perception of food responses 
through the use of direct questioning, 
by the Rosenzweig Picture-Frustra- 
tion Test, the Kent-Rosanoff Free- 
Word Association Test, and the 
Rorschach. The number of food re- 
sponses showed no significant change 
as a function of time. Apparently the 
projection of deprived need is not a 
simple phenomenon. Projection oc- 
curs only when the S believes there is 
some chance of immediate gratifica- 
tion. When it becomes apparent 
that gratification will be delayed 
there is no projection. Projection is 
once again seen to be subsidiary to 
the need-gratification expectation of 
the perceiver. 


Rationalized”’ Projection 


The data with regard to “rational- 
ized”’ projection have supported this 
meaning of the concept in the few 
experiments reported. In a study at 
the University of Vienna, Frenkel- 
Brunswik (1939) had four judges 
rate the conduct of some 40 students 
as well as write a description of their 
personalities. The students wrote an 
autobiography concerning their con- 
duct at the University, the principles 
guidingg their conduct, and_ the 
changes they felt should be made. 
She found that many students felt 
that the environment should com- 
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pensate for their personal shortcom- 
ings. “In this instance the subject 
seems not to realize his own defect, 
but rather to project it on the en- 
vironment” (1939, p. 418). For ex- 
ample, the rho correlation between 
the rating of lack of scientific ability 
by the judges and comments by the 
students of a need for pedagogical 
changes was .60. A lack of discipline 
on the part of the students was ra- 
tionalized by their demand for more 
regimentation (rho=.62). Moreover, 
the overambitious persons often 
checked ‘I always do what I am 
ordered to do,” and the aggressive 
ones asserted “‘I do not let myself be 
intimidated.”’ 

In a unique and imaginative ex- 
periment, Posner (1940) gave a 
group of eight-year-old children two 
toys to play with, one preferred, and 
the other nonpreferred. They were 
then asked to give one to a friend to 
play with, after which each child was 
asked which toy he thought the 
friend would have given away. A 
selfish judgment (friend also would 
have given away the nonpreferred 
toy) was considered to indicate 
projection. The control situation, 
created to avoid guilt feelings, and in- 
volving another matched group, did 
not require the child to give away 
one of the toys. Under these circum- 
stances there was a much smaller de- 
gree of projection of selfishness. 

Bellak (1944) found that the ag- 
gressive word content increased in 
TAT stories in the last five cards, 
when subjects were severely criti- 
cized, after receiving the first five 
cards without comment. Although 
Bellak interpreted this as ‘‘true” 
(Freudian) projection, he reported 
that when criticized for the poor 
quality of their stories, the subjects 
admitted their inabilityZbut offered 
excuses. They blamed the “ambigu- 
ity’ of the pictures and the “‘inade- 
quacy”’ of the instructions for their 
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poor performance. Apparently, the 
subjects were producing ‘‘rationa- 
lized”’ projection rather than “‘clas- 
sical’’ projection. Their stories mani- 
fested considerable hostility of which 
they were aware, but which they 
could not express because of the 
authoritiative position of the ex- 
aminer (Murstein, 1957b). 


Miscellaneous Studies of Projection 


Several experiments could not be 
categorized either because of a lack 
of information concerning the con- 
cept of projection or because the aim 
of the experiment was to arrive at 
some refinement of categories (Cat- 
tell & Wenig, 1952). Holt (1951) 
asked 10 judges to rate each of 10 
college students on 36 personality 
variables, including ‘‘projection”’ on 
a 6-point scale. ‘‘Self-insight’’ was 


the only concept not rated as such, 
but was obtained as the sum of the 
squared differences between the rat- 


ings of the judges and the subjects. 
The correlation between ‘‘projec- 
tion” and “self-insight’? was .50, 
which, because of the paucity of 
subjects, did not quite reach signif- 
icance at the .05 level. The correla- 
tion of “projection” with the six most 
attractive traits was .54; with the 
six most unfavorable traits .04. ‘‘In- 
telligence’’ was found to correlate .48 
with “projection.”’ These rather un- 
usual findings are difficult to inter- 
pret because one does not know what 
conception of projection each judge 
held. Another difficulty is the lack of 
information as to the relative number 
of favorable traits compared to un- 
favorable ones. If one hypothesizes 
that the judges were using an attrib- 
utive concept of projection without 
reference to the self concept of the S, 
then the results are somewhat more 
understandable. In a fairly homo- 
geneous select group (Harvard col- 
legians), the smarter, perhaps better 
adjusted students, would tend to re- 
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fer to others as similar to themselves 
(probably quite correctly), and to be 
perceived by the judges as “‘projec- 
tors,”’ since they would have attrib- 
uted characteristics to others that 
they themselves possessed. Once 
again, in this eventuality, one would 
deal with persons classified as ‘‘pro- 
jectors” solely through the small 
amount of group variance resulting 
from group homogeneity. 

The Blacky Test was used by 
Cohen (1956) on a college group in 
which Ss were rated on the dimen- 
tions of projection, regression, reac- 
tion-formation, and avoidance. The 
Ss worked together on a task and 
evaluated each other on interpersonal 
ability. Cohen found that ‘“pro- 
jectors’’ in a group showed more 
negativeness and hostility in inter- 
personal relationships towards each 
other than when in other groups. 
When paired with ‘‘nonprojectors,” 
however, ‘‘projectors’’ were not more 
threat-oriented than other dissimilar 
pairs. The hierarchy of defense re- 
sponses with regard to perceived 
negative interaction was (a) projec- 
tion, (b) regression, (c) reaction for- 
mation, and (d) avoidance. 

These interesting results indicate 
that projection, as measured by the 
Blacky Test, seemed to give an in- 
dication of noncommunication re- 
sulting from the perception of ex- 
treme threat to the self. It would 
have been helpful if the basis for de- 
ciding who was, and who was not, a 
projector had been discussed in 
greater detail. 

Cattell has concerned himself with 
the various concepts of projection 
(1944, 1951). An ambitious attempt 
to factor-analyze some of the different 
conceptions of projection was made 
by Cattell and Wenig (1952). The 
authors used the term “mispercep- 
tion’’ because they felt that they were 
dealing with measurement to a dis- 
crepancy between individual reac- 
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tions to the TAT-like pictures, which 
they used, and some superindividual 
standard of reaction. They believed 
that “‘misperception”’ is affected by 
three considerations: (a) abilities, 
(b) experiences (information, skills), 
and (c) the dynamic needs of the sub- 
ject. Their hypothesis was that the 
nature and magnitude of mispercep- 
tion effects generally are to be ac- 
counted for by eight misperception 
factors within the Ss. These were: 
(a) cognitive (intelligence), (b) cogni- 
tive (information concerning the 
field), (c) cognitive (information con- 
cerning the principal person), (d) 
consciously accessible dynamic traits 
(autism), (e) press-compatibility, and 
the defense mechanisms: (f) projec- 
tion, (g) rationalization, and (h) 
phantasy. 

A factor analysis of the stories se- 
lected (8 choices were possible, one 
for each hypothesized factor), and 
other ‘‘marker’’ variables did not 
seem to bear out the hypothesis. 
Various factors were teased out of 
the data and given labels of ‘“phan- 


tasy,” ‘“‘naiveté,” “autism,” ‘‘ra- 
tionalization,” and “true  projec- 
tion.”” Still, it is evident from the 


examination of loadings on each fac- 
tor that several kinds of ‘‘mispercep- 
tion’”’ are represented in each factor. 
On the “naiveté”’ factor, the ‘‘naiveté”’ 
measures range from .77 to .48. 
Nevertheless, a ‘“‘press’’ loading of 
approximately .40, as well as an 
“autism” loading of equal magnitude, 
was present. The “autism” factor 
contained substantial loadings for 
“rationalization” as well as_ for 
“autism.” The ‘true’ projection 
factor was loaded with ‘‘press’’ pro- 
jection, “phantasy,” and “‘naiveté.” 

Cattell and Wenig seemed mostly 
concerned with the high “press” 
loading on the “‘true’’ projection 
factor. They stated that a single 
underlying mechanism might be in- 
volved both in projecting an un- 
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conscious drive and in reconciling ex- 
ternal facts with conscious internal 
moods, namely, a need for self con- 
sistency. “If so, the underlying pro- 
cess would be better labeled as ‘self- 
saving’... . Secondly, and far more 
important, the misperception is not 
due to a single process, projection, 
but to several dynamic mispercep- 
tion processes, which superimposed, 
frequently act in different directions” 
(1952, p. 807). 

While Cattell and Wenig’s con- 
clusions were based on an extensive 
analysis of the data, there appears to 
be a serious difficulty with regard to 
their operational conception of what 
constituted the various kinds of*‘mis- 
perception” or “‘projection” as shown 
by these examples of what they re- 
garded as different kinds of projec- 
tion: the woman in the picture 1s 
dominating the man because he wants 
her to... he enjoys being dominated 
(autistic misperception); the man 
kneeling at the boy’s side is dominat- 
ing the boy because the boy is a very 
submissive person which necessitates 
that he be led (press-compatibility 
misperception) ; the older man is the 
boy’s father who dominates the boy for 
his own good (rationalization). 

One wonders whether any basic 
differences exist between these three 
themes. All of them seem to embody 
a “rationalized” type of projection. 
The difficulty in detecting clearly in- 
dependent kinds of projection may 
very well be traced to the noninde- 
pendence of the choices presented 
to the Ss. It would be of interest to 
repeat this study using valid opera- 
tions for distinguishing the different 
types of projection. 

Lastly, Jenkin (1956) had Ss look 
at pictures of varying ambiguity pro- 
jected onascreen. ‘‘Projection”’ was 
said to occur when the subject 
seemed sure of the objective reality 
of his interpretation. ‘‘Rationaliza- 


tion” occurred when the subject made 
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his interpretation in a_ tentative 
manner. Correlations of this version 
of projection with various Rorschach 
and Rosenzweig P-F determinants 
were reported. Projection was sig- 
nificantly correlated with the Ror- 
oo ge M%,. (.38), Rosenzweig’s 
E% (.36), and E-D% (.69). Nega- 
tive correlations were found for 
Rosenzweig’s M% (-—.35), and 
N-P% (-—.54). “Rationalization” 
was positively correlated with Rosen- 
zweig’s M% (.42), and negatively 
correlated with his E% (—.38) and 
the Rorschach’s W% (—.49), and 
M% (—.43). Obviously, “‘projection”’ 
here meant a confident assertive 
method of reporting perceptions while 
“rationalization” signified a Casper 
Milquetoast-like approach. 

The correlation between projec- 
tion and _ projection/rationalization 
was reported as .94, while that of 
rationalization with projection/ra- 
tionalization was found to be —.40. 
Both are artifacts containing spurious 
common elements. In the first case, 
the spurious element in the numera- 
tor (projection) assured a positive 
correlation, while in the latter case, 
the common element in the denom- 
inator (rationalization), provided an 
artifactual negative correlation. 


BERNARD I. 


The ‘‘Operational’”’ Dilemma 


The adherents of the psychoanalyt- 
ic school of personality measurement 
have been strongly attacked for their 
failure to define operationally such 


concepts as “projection” and “re- 
pression.”’ These criticisms, justified 
to a large extent, have led the more 
recent psychodynamically-oriented 
researchers to embrace more opera- 
tional definitions of such variables as 
“‘projection,”’ “empathy,” “reality,” 
and “‘insight.’’ Unfortunately, the re- 
sult of zealous operationism has been 
a neglect of the original psychological 
meaning of these variables. Instead, 
we are faced with a string of opera- 
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tional phrases of the ‘A’’ rates him- 
self (‘A’) as he thinks “B” would 
rate him—variety. When one has 
plodded through the adding and sub- 
tracting of the 8 operations necessary 
to arrive at a score of ‘“‘Refined Em- 
ce Ml or the 12 operations used in 
correlating ‘Refined Empathy” with 
“Projection’”” one may be sorely 
tempted to return to the ‘‘good old 
days” of literary definition. What is 
needed is an operational definition 
which does not depart from the ac- 
cepted psychological meaning of a con- 
cept. It is clear that much of the re- 
search reviewed in this paper has 
failed to fufill this need. Instead, 
judges have been asked to predict, 
according to Gage and Cronbach 
(1955), such diverse operations as: 


(a) how persons in general will behave; 

(b) how a particular category of persons de- 
viates from the behavior of persons in 
general; 

(c) how a particular group deviates from 
the typical behavior of the particular 

category it belongs to; 
how an individual deviates from ‘the 
typical behavior of the particular group 
he belongs to; 
how an individual on a particular occa- 
sion will deviate from his typical be- 
havior (Gage and Cronbach, 1955, p. 
413). 


One is faced here with many abili- 
ties whose relationship to personality 
patterns is rather slight. Cronbach 
(1955) mentions several of these abili- 
ties including ‘‘differential eleva- 
tion,’ measuring the forecaster’s 
ability to judge deviation of the in- 
dividual’s elevation from the average; 
“stereotype accuracy,’ measuring 
the judge’s accuracy in predicting the 
“generalized other’’; ‘differential ac- 
curacy,” reflecting the judge’s ability 
to predict differences between sub- 
jects on any item. The shrewd judge 
is often one whose predictive variance 
(ay) does not exceed the self predic- 
tive variance of the person being 
judged (¢,). ‘Accuracy is improved 





THE CONCEPT OF PROJECTION 


as dy approaches rxyox. That is to 
say, the variation in predictions 
should never exceed the variation in 
true responses and should ordinarily 
be much smaller’? (Cronbach, 1955, 
p. 181). 

Crow (1954) found predictions to 
be accurate for the generalized 
“other”’ person, when the group for 
whom predictions were made was 
(a) homogeneous and (b) judges ex- 
hibited response which were 
general over “‘others.’’ In another 
study, Crow and Hammond (1957) 
found that the response set of medical 
students (perceptual stereotypy of 
patients’ responses to a personality 
scale) was more consistent over time 
than differential accuracy (the abil- 
ity of S to predict differences be- 
tween other people on any item). 

Eight keys to interpersonal per- 
ception are listed by Gage, Leavitt 
and Stone (1956) as affecting ac- 
curacy. The keys are hypothetical 
protocols derived from various con- 
structs and compared with the fore- 
caster’s prediction for S and S’s self- 
response. The keys found to be im- 
portant in determining 
were: 


sets 


responses 


A Priori Keys: 


1. Acquiescent Tendencies on the 
Part of the Judge and S. If both are 
highly acquiescent, the former in pre- 
dicting, and the latter in answering 
personality inventories, accuracy of 
interpersonal preception will be high. 

2. Favorability of the Judges’ Pre- 
dictions and of the S's Self. If the 
items possess high social desirability 
and if the S wishes to appear in a 
favorable light, there will again be 
high accuracy of perception by the 
judges. . 

3. Adjustment Tendencies Derived 
from Adjustment Value of Responses. 
These are closely allied to the ‘‘favor- 
ability”’ tendencies. 
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Keys Obtained by Varying Instruction 
to Judges: 

4. Judge's Self-Description. Judges 
may be accurate because of high sim- 
ilarity to the subject and their as- 
suming similarity in their predic- 
tions, or they may be dissimilar, and 
assume little similarity and still be 
accurate. 

5. Stereotypy. By predicting the 
typical member, a stereotypy key is 
obtained. Accuracy ensues when the 
judge follows his stereotype when the 
subject is closely similar to it. 

6. A manifest stimulus value key 
may be derived from the modal 
judge’s descriptions of the subject. 
This key relates to the impression S 
makes on those judging him. When 
combined with S’s self perception, it 
yields an “insight” or “frankness” 
score. 


Keys Based on Central Tendencies of 
Predictions or Self Descriptions 

7. Modal Prediction keys may stem 
from (a) the average prediction of n 
judges for a single subject, and (0) 
the average prediction by a single 
judge for many subjects. The former 
key will result in high accuracy when 
judges make highly typical predic- 
tions for highly predictable others. 
High accuracy also will occur when a 
judge makes an atypical correct pre- 
diction for the way S will answer an 
item, while the majority of judges 
make a stereotyped but incorrect 
choice. The latter key (6) gives a 
measure of a judge’s “implicit stereo- 
type.” 

8. A modal self-description of the 
subjects may be utilized against a 
single subject’s self-responses and a 
single judge’s prediction. Accuracy 
then depends upon the judge’s stereo- 
type of the group as well as the in- 
dividual’s deviation from the stereo- 
type of the group. 

It is apparent that these various 
“keys” do not reflect very much 
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about defense mechanisms. In- 
stead, they seem to consist of cogni- 
tive response habits in which a 
fastidiousness for operationism has 
resulted in a growing aridness of 
conceptual meaning. One may well 
speculate as to the multitude of 
factors which would result from a 
factor analysis of the current opera- 
tional definitions of ‘“‘projection.”’ It 
is therefore hardly remarkable that 
projection has been found to corre- 
late both positively and negatively 
with such variables as ‘‘insight’’ and 
“empathy.” The dilemma can be 


resolved by insisting that operation- 
ism be utilized jointly with clinical 
meaning rather than 
planting the latter. 


merely sup- 


DISCUSSION 

From birth, man views the world 
imperfectly, using his sensitive but 
far from perfect eye as a camera to 
bring him into communication with 
the outer world. While it is true that 
staies correlated with cortical im- 
pulses are projected to the supposed 
location of the stimulation, such a 
concept of projection seems too 
broad to be of use. We propose to 
limit the definition of projection to 
perception or judgments having to 
do with the personality of the organ- 
ism, thereby eschewing any physio- 
logical or cognitive components. A 
definition which is broad enough to 
cover the material reviewed in this 
article is as follows: 

Projection: The manifestation of 
behavior by an individual which in- 
dicates some emotional value or need 
of the individual. 

Such behavior may vary in the degree 
of defensiveness, depending upon the 
situational context and the personal- 
ity of the perceiver. Thus, where 
little or no ego-threat is involved, a 
person may project his emotional 
values through his method of or- 
ganizing and selecting his personal 
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milieu and the objects inhabiting it. 
One might term such behavior as 
“life style’ projection. The fact that 
these behaviors are public usually in- 
dicates that they involve little threat 
to the “‘self.”’ ‘‘Autistic’’ projection 
illustrates the reaction to strong 
needs, but does not necessarily mean 
these needs are defensive ones. For 
one who eagerly awaits a visit from 
Aunt Agatha, who will be wearing a 
green coat, any middle-aged woman 
emerging from the train wearing a 
green coat may be momentarily mis- 
perceived as the expected relative. 
In “rationalized’’ projection, how- 
ever, we deal with data in which, 
despite the fact that the content is 
readily accessible to consciousness, 
the motivation 1s distorted so as to de- 
ceive with regard to the real intent 
of the act. The peak of distortion, 
“classical’’ projection, occurs when 
even the content must be denied be- 
cause of the extent of the threat,and 
the individual sees the unwanted 
behavior. as stemming wholly from 
outside sources. 

It will be noted that the concept of 
“attributive’’ projection has been 
omitted from this discussion. The 
reasons for so doing are that the 
operation of saying another would do 
as the S would do, need not involve 
any emotional values. It may stem 
instead from (a) a shrewd knowledge 
of the other person’s behavioral ten- 
dencies (perceiving correctly actual 
similarities between the ‘‘other’’ and 
oneself), (b) dull intellect (perceiving 
another as similar when he is not), 
(c) lack of information (‘‘naive”’ pro- 
jection), or (d) the differing needs of 
the person, which are either rela- 
tively nondefensive (‘‘autistic’”’ pro- 
jection), or strongly defensive (‘“‘ra- 
tionalized”’ or “‘classical’’ projection). 

In general, we have been critical of 
the “operational’’ school of re- 
searchers who have attempted to 
measure projection on the basis of a 
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forecaster’s response regarding him- 
self and others through the use of 
personality inventories and ques- 
tionnaires. The correlations accruing 
from these studies have usually been 
spurious in containing identical oper- 
ations in the variables correlated. 
Moreover, the number of correlations 
have often exceeded the available de- 
grees of freedom. 

In addition to being parsimonious, 
the operation measuring projection 
should be closely allied to the per- 
sonality construct being investigated 
so that one is not confronted with the 
phenomenon of coldly objective, un- 
involved persons mysteriously emerg- 
ing as “projectors.” Projection 
should not be a function of cognition, 
but of emotional involvement. 

In achieving the goals set forth, 
the measuring instruments should 
embody the following considerations: 

(a) The instrument should have a 
low internal consistency for any 
possible intermediary keys not re- 
lated to the investigation. 

(b) If the composition of the ex- 
perimental group is in the hands of 
the investigator, the group should be 
made as heterogeneous or representa- 
tive as possible (Crow, 1957a). This 
would also check against a lucky 
guess in which one person was gen- 
eralized for a similar population, thus 
giving a spuriously high predictive 
efficiency score. The representative- 
ness of a group might also prevent 
spuriously poor predictions when 
several individuals judge only one 
person who well may be an atypical 
member of his group. 

(c) It is extremely unlikely that a 
standard stimulus (ambiguous pic- 
ture) will elicit the same personality- 
meaningful reaction in every person. 
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For example, if as a result of short- 
term food deprivation a large ma- 
jority of persons see food or instru- 
mental activity leading to food, we 
do not have a manifestation of pro- 
jection. Rather, by the very fact that 
most persons react in the same man- 
ner, we may conclude that we deal 
with a biosocial reaction with little 
meaning for the understanding of the 
emotional life of a given individual. 

(d) The relationship between the 
information possessed by S and the 
kind of judgment asked of him should 
be investigated more thoroughly. 
Presumably the more “internal” the 
perception of others, the less relevant 
the information fed to the perceiver 
need be to induce inappropriate per- 
ception. 

(e) If projection is to be a pheno- 
menon involving emotional values 
with regard to the self, it is the task 
of the experimenter to indicate in 
what manner the judgments of the 
perceiver are indicative of these 


values rather than of relatively ex- 
traneous cognitive-response habits. 
In closing, one must allow much 
credit to the earlier workers involved 
with the concept of projection. Their 
bold sallies into unknown and tricky 
terrain have stimulated more exten- 


sive work in this area. Many subtle 
artifacts have been painstakingly re- 
moved from the material supposedly 
concerned with ‘‘projection,’’ ‘‘em- 
pathy,” “similarity,” “reality,”’ and 
“insight.”” One might guess that we 
are about ready for a new and more 
sophisticated set of personality con- 
structs. Our position is like that of 
the harried Christmas shopper who 
knows quite clearly what he does not 
want, but is not quite sure where to 
go from that point. 
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An astonishing number of advances 
have been made recently by pharma- 
cologists, biochemists, and neuro- 
physiologists into possible causative 
chemical factors of schizophrenia and 
other psychoses. As a result of the 
reported efficacy of the “tranquil- 
izers’’ and ‘‘psychic energizers”’ in the 
management of some psychotic pa- 
tients, many plausible fruitful hypoth- 
eses have been generated and very 
often verified. As these advances 
continue to be made, the conviction 
grows among workers in the area that 
the organic (chemical) changes re- 
cently discovered in schizophrenia 
are relevant to its etiology. 

The purpose of this review is to 
make available an introductory over- 
view of several of the outstanding 
theoretical formulations of the bio- 
chemistry of psychotic behavior for 
the experimental as well as the clini- 
cal psychologist. These theories 
should also prove useful in providing 
some rational basis for several chemo- 
therapies currently employed with 
psychiatric patients. However, prior 
to any such consideration it is of in- 
terest to review the results of some 
broad biochemical investigations 
which in the absence of specific hy- 
potheses nevertheless uncovered in- 
teresting correlates which await in- 
tegration into an as yet undefined 
unified biochemical theory of schizo- 
phrenia. 


Some Physiologic and Biochemical 
Correlates of Psychotic Behavior 
Any attempt to but briefly de- 


scribe all or even a significant num- 
ber of the studies that have shown 


correlation between physiological or 
biochemical factors and psychotic be- 
havior would necessitate a separate 
review or even a volume. Instead, a 
limited number of studies will be cited 
that are representative of the broad 
spectrum of research areas that have 
been pursued intensively. Some 
workers have focused their attention 
on the possibility that schizophrenia 
may be due to a cerebral toxin which 
is the by-product of a metabolic def- 
icit. Thus, it has been reported 
(Fischer, 1953) that the blood of 
schizophrenics was toxic to tadpoles 
while Federhoff and Hoffer (1956) re- 
ported that schizophrenic blood was 
toxic to fibroblasts nurtured in a tis- 
sue culture. However, this toxicity 
does not appear to be due to a toxic 
chemical specifically found in schizo- 
phrenic blood because the latter 
workers also found that the blood of 
patients undergoing surgery was 
equally toxic. This finding does sug- 
gest that some chemical, toxic to 
tadpoles and fibroblasts, is found in 
the bloodstream of people exposed to 
severe stress and is also found in the 
blood of schizophrenics. Reider 
(1957) employed Witt’s spider-web 
technique to determine the effect of 
schizophrenic urine on the complex 
perceptuomotor organization _ re- 
quired in web building and found 
significant abnormalities in the web 
pattern. Orstrém (1951) has focused 
his attention on phosphorus metab- 
olism and has reported a diminished 
turnover of adenosine triphosphate 
and a higher content of phosphogly- 
colic acid in the erythrocytes of 
schizophrenics. Still another after- 
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math of metabolic impairment may 
be reflected in lowered concentra- 
tions of carbonic anydrase in the 
occipital cortex of schizophrenics and 
in early cases in the frontal cortex as 
reported by Ashby (1950). Richter 
(1957) has stated that the different 
enzyme systems of the liver seem to 
be impaired in schizophrenics and 
that different enzyme systems may 
be affected in varying degrees. How- 
ever, he emphasizes that the enzy- 
matic malfunction need not be the 
invariable concomitant of  schizo- 
phrenia. 

In addition to these findings of dis- 
turbed enzymatic reactivity or meta- 
bolic defect, there are physiological 
studies that have employed measure- 
ments of body temperature, heart 
rate, basal metabolic rate, and thy- 
roid activity in normals and _ schizo- 
phrenics which demonstrate a greater 
variance of the measurements in 
schizophrenics than in healthy peo- 
ple. These results suggest that the 
schizophrenias represent a genera- 
lized disorder of steady-states. In 
summary, the few empirical findings 
reviewed up to this point demon- 
strate that schizophrenia may be 
characterized by some disordered 
metabolic response to stress, a defect 
in several important steps in phos- 
phorus metabolism, a disturbance 
in the balanced distribution of car- 
bonic anhydrase in the cerebral cor- 
tex, impaired enzymatic activity of 
the liver and finally, a disordered 
homeostasis which is reflected in the 
measurement of various steady-states. 


““M-SUBSTANCE”’ 


An important biochemical theory 
of schizophrenia was formulated 
within this decade by Osmond and 
Smythies (1952), and elaborated by 
Hoffer, Osmond, and Smythies, 
(1954). The reader may be interested 
in the logical development of this 
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theory. It was Hoffer (1957) who 
claimed that schizophrenia is a dis- 
ease of the autonomic nervous sys- 
tem. Although vague as to the 
nature of the primary defect, whether 
it is a constitutional biochemical! fac- 
tor or psychogenic, the consequent 
events are each stated with war- 
rantable assertability. A dominant 
feature of the autonomic disturbance 
is an increased parasympathetic ac- 
tivity or pronounced central choliner- 
gic activity. This increased produc- 
tion of acetylcholine, by stimulating 
sympathetic ganglia, in turn pro- 
duces an increase in the secretion of 
norepinephrine and_ epinephrine. 
Whereas in normal individuals epin- 
ephrine is metabolized by amine 
oxidase or sulfoesterase, in schizo- 
phrenia defective metabolism of 
epinephrine, by a phenolase rather 
than the amine oxidase or sulfoes- 
terase, results in the production of 
quinone indoles (adrenochrome and 
adrenolutin) which interfere with 
cerebral metabolism. To prove this 
assertion that a cerebral toxin which 
is the by-product of sonie faulty 
metabolic process is the primary 
etiological factor in schizophrenia re- 
quires that a metabolite be isolated 
from schizophrenics which when in- 
troduced into healthy people should 
produce schizophrenia; or, the toxic 
metabolite should be found only in 
those people afflicted with schizo- 
phrenia; or, the toxin should be found 
in larger quantities; or, greater sensi- 
tivity to the metabolite by schizo- 
phrenics must be demonsirated. The 
next step in their analysis required 
the isolation of one metabolic fault 
out of the myriad metabolic processes 
that characterize life. They ap- 
proached this problem by emphasiz- 
ing the reported relationship be- 
tween certain classes of chemical 
compounds and disordered behavior. 
Specifically, the “hallucinogens” had 
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the property of producing in healthy 
people affects judged similar to the 


symptoms of schizophrenia. It 
should be emphasized that these 
theorists did not equate schizo- 


phrenia with the state produced by 
LSD (lysergic acid diethylamide) or 
mescaline. They viewed the hallu- 
cinogens as capable of engendering a 
behavioral complex which could serve 
as a “model psychosis.’’ The last 
step in the theoretical formulation re- 
quired the identification of some 
endogenous metabolite which had a 
chemical structure similar to that of 
mescaline and the theorists pointed 
to adrenaline. However, as adrena- 
line is an essential neurohumor, it 
was proposed that derivatives of 
adrenaline produced as a result of 
faulty metabolism may serve as the 
“‘M-Substance,”” the mescaline-like 
toxin of schizophrenia. Axelrod 
(1957) was soon to provide a clue as 
to the manner in which the substance 
could be formed when he found that 
methylation of the phenolic groups of 
adrenaline took place in vivo. Res- 
nick, Wolfe, Freeman, and Elmedilah 
(1958) reported that adrenaline is al- 
most entirely detoxified by the proc- 
ess of O-methylation in human be- 
ings. These chemical findings tended 
to support the speculations of the 
early theorists that faulty metabo- 
lism of adrenaline, possibly excessive 
methylation, could produce the toxin 
in schizophrenics. 

The quest for the exact description 
of ‘‘M-Substance”’ was given further 
impetus when it was claimed that 
“pink adrenaline’ could produce 
psychotic-like reactions in healthy 
people when injected. ‘Pink adrena- 
line,”’ it should be explained, is the 
resultant mixture of auto-oxidized 
products of adrenaline obtained when 
adrenaline is left exposed to light and 
air for some period of time. It was 
now suggested that one of the 
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oxidized products such as adreno- 
chrome or adrenolutine (trihydroxy 
N-methyl indole) or some other com- 
ponent of the oxidized mixture was 
the endogenous toxin, the ‘‘'M-Sub- 
stance.”’ In the presence of many 
stabilizing substances in the blood, 
however, it is difficult to conceive of 
auto-oxidation of epinephrine in vivo. 
Experiments have been performed in 
in effort to identify the active com- 
ponent of the oxidized mixture which 
is responsible for the production of 
disturbed behavior. Obviously, it is 
to be expected from the foregoing 
that the injection of some toxin of the 
oxidation products of adrenaline 
should produce aberrant behavior in 
healthy people, or, the toxin should 
be found in schizophrenics and not in 
healthy people, or a significant quan- 
titative difference should be found. 
Hoffer (1957) published observations 
on adrenolutine, an isomer of adreno- 
chrome which is more stable. He tried 
to determine which persons had been 
given adrenolutine and which the 
placebo with equivocable results. 
Gastaldi (1957) attempted to ascer- 
tain the effect of the more highly re- 
active, unstable adrenochrome and 
reported the absence of behavioral 
abnormalities. In a recent study 
(Holland, Cohen, Goldenberg, Sha, 
& Leifer, 1958) equilibrium plasma 
concentrations of adrenaline and 
noradrenaline were measured and no 
significant difference between healthy 
people and schizophrenics with re- 
gard to the rate of utilization of cir- 
culating adrenaline and noradrena- 
line was found. Axelrod (1958) at- 
tempted to detect the presence of 
adrenochrome in schizophrenics and 
found that it was absent in schizo- 
phrenics as well as healthy people. 
Hoffer (1957) deduced from his 
theory that chemical interference in 
the production of epinephrine from 
methylation of norepinephrine should 
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effectively reduce the amount of 
adrenochrome and thus lead to 
significant improvement in psychiat- 
ric patients. As nicotinic acid and 
nicotinamide are potent methyl 
group acceptors in vivo, he adminis- 
tered these compounds, and reported 
considerable success in the treatment 
of acute schizophrenics and little or 
no success with chronic schizo- 
phrenics. These results do not neces- 
sarily substantiate the theory. 

The disappointing results of these 
experiments have been rationalized 
by the theorists in at least two ways. 
First, it has been argued that so un- 
stable and biologically active a ma- 
terial as adrenochrome if introduced 
into the blood stream may not sur- 
vive unchanged by the time it 
reaches the brain. Second, the point 
is made concerning the relevance of 
the blood-levels of the various endog- 
enous metabolites to the activity of 
these toxins in the brain. For, if 
adrenaline and noradrenaline are 
neurohumors, then the substances re- 
quired for their synthesis and destruc- 
tion are also in the brain. Faulty 
metabolism of these catechol amines 
could produce toxins locally which 
would not be reflected in the blood 
because the toxins could not pass the 
blood-brain barrier. 

The evidence in support of the 
prominent role ascribed to adreno- 
chrome is based on some physiolog- 
ical observations on animals and in- 
direct clinical evidence. Schwarz, 
Wakin, Bickford, and Lichtenheld 
(1956) reported that intraventricular 
injections of adrenochrome produced 
a drowsy state in monkeys with man- 
ifest heightened thresholds to painful 
stimuli. Walaszek, Smith, and Minz 
(1958) found that the serum of 
schizophrenics either abolished or re- 
versed the systemic vasopressor re- 
sponse produced by topical applica- 
tion of adrenaline to the exposed 
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cerebral cortex of the rabbit, whereas 
the serum of healthy people had no 
such effect. Indirect clinical evi- 
dence to support the role of adreno- 
chrome in schizophrenia was pro- 
vided by Lea (1955). He inferred 
that since adrenochrome is antihis- 
taminic in action, if schizophrenics 
have abnormally excessive quantities 
in their circulation, then they should 
be more resistant to allergic condi- 
tions than healthy people. In his 
study of military schizophrenics, us- 
ing head-injured soldiers as controls, 
he found a highly significant defi- 
ciency of allergic reactions among the 
schizophrenics. 

In summary, the theory which pos- 
tulates the existence of an ‘‘M-Sub- 
stance,’’ an endogenous metabolite of 
adrenaline produced by a deficiency 
in the process of methylation, as yet 
lacks the empirical verification for 
the criteria requisite for confirmation 
of the theory as established by the 
early theorists. Evidence is stiil 
forthcoming that the production of 
schizophrenic symptoms can be elic- 
ited in healthy people by a metab- 
olite that is isolable from schizo- 
phrenics, or, that the metabolite is 
found solely in schizophrenics, or at 
least in significantly larger amounts 
in them as compared to healthy 
people. 


ANTIMETABOLITES OF SEROTONIN 


While the previous biochemical 
theory of schizophrenia, it would 
seem, was established by an observa- 
tion of the similarity in chemical 
structure between an _ hallucinogen, 
mescaline, and the neurohumors, 
adrenaline and noradrenaline, the 
second theory here to be considered 
was formulated after the observation 
was made that the most powerful 
hallucinogen, LSD, was a potent pe- 
ripheral antagonist of serotonin (5- 
hydroxytryptamine). The role of 
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serotonin as a chemical mediator of 
neural activity in the brain has been 
described by Brodie and Shore (1957). 
The clue as to its important role came 
from its discovery in the brain. Like 
acetylcholine, the substance is pres- 
ent in nervous tissue in a precursor 
state and-is active only in an un- 
bound form. The amine is unevenly 
distributed in the brain, its concen- 
tration being highest in the brain- 
stem, especially the hypothalamus, 
lowest in the cortex, and almost un- 
detectable in the cerebellum. The 
high biological activity of this amine 
strongly suggests an important func- 
tion in chemical mediation in sub- 
cortical centers. Additional support 
for a role of serotonin in neural 
transmission comes from considera- 
tion of the distribution of the enzyme 
monoamine oxidase, the enzyme that 
destroys serotonin, and 5-hydroxy- 
tryptophan decarboxylase, the en- 
zyme that synthesizes serotonin, 
both of which are found in highest 
concentration in the hypothalamus. 

Woolley and Shaw (1954) postu- 
lated that the hallucinogenic effect of 
LSD might be due to its interference 
with the action of serotonin cen- 
trally, in the brain. An excess of 
serotonin or a deficiency may lead to 
transmission dysfunction and con- 
comitant behavioral disorder. Ac- 
cording to Woolley and Shaw, the 
action of LSD, as well as several other 
alkaloids, can be ascribed to the in- 
dole moiety in the structures of LSD 
and serotonin. Consequently, com- 
pounds containing the indole ring 
may act as antimetabolites to sero- 
tonin. LSD is enough like serotonin 
in structure to be taken up by the 
serotonin receptors in the brain in 
lieu of serotonin. This suggests that 
mental aberrations are produced by 
an inadequate amount of serotonin 
at its site of action. However, these 
same workers (Shaw & Woolley, 
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1956) also showed in a later paper 
that LSD could act like serotonin in 
potentiating its effect on the blood 
pressure of anesthetized dogs. This 
then suggested the possibility that 
there may be an excessive concentra- 
tion of serotonin in other forms of 
mental disease. 

Several studies can be cited which 
tend to support the theory that inter- 
ference in the activity of serotonin in 
the brain (possibly by an endogenous 
antimetabolite containing the indole 
nucleus) is related to psychotic be- 
havior. In one study (Zeller, 1958) it 
was found that after the administra- 
tion of large quantities of tryptophan 
to schizophrenics and healthy people, 
the schizophrenics excreted signifi- 
cantly less 5-hydroxyindolacetic acid 
than the normals. This acid is a 
break-down product of serotonin 
which in turn is a derivative of 
tryptophan. 

The serotonin hypothesis is further 
favored by the fact that reserpine, a 
tranquilizer which contains an indole 
ring, depletes the brain of its store 
of serotonin and its tranquilizing 
effect parallels the time course of this 
depletion and not the time course of 
the presence of reserpine in the brain 
(Shore, Pletscher, Tomich, Carlsson, 
Kuntzman, & Brodie, 1957). It was 
deduced, therefore, that reserpine 
and its active analogues bring about 
tranquilization through interference 
with cellular binding of serotonin. 
Within the framework of the seroto- 
nin hypothesis, Brodie and Shore 
have proposed the following hypo- 
thetical scheme to account for the 
tranquilizing effect of reserpine and 
chlorpromazine and the _ hallucino- 
genic effects of LSD and mescaline. 
They propose that the hypothalamic 
parasympathetic centers are acti- 
vated by serotonin-liberating, or 


“serotonergic” nerves, and the sym- 
pathetic centers by norepinephrine- 
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liberating, or adrenergic nerves. 
Dominance of the parasympathetic 
centers causes sedation or tranquili- 
zation, and that of the sympathetic 
centers, wakefulness. Reserpine, by 
preventing serotonin binding, will 
thus cause constant activation of the 
parasympathetic centers, while chlor- 
promazine, the other potent tran- 
quilizer, would bring about sedation 
by blockade of adrenergic impulses 
to the central sympathetic centers 
and the consequent dominance of 
parasympathetic activity. 

This paradigm permits Brodie and 
Shore to postulate the following 
mechanism of action of the two most 
potent hallucinogenic compounds. 
LSD blocks stimulation of the hypo- 
thalamic parasympathetic centers by 
interfering with the release of seroto- 
nin from the “serotonergic nerves,” 
thus unmasking the action of the 
opposing sympathetic system. Mes- 
caline, on the other hand, stimulates 
the “‘adrenergic”’ brain centers of the 
posterior hypothalamus directly, and 
mimics the action of norepinephrine. 
Thus an apparent parallelism be- 
tween psychotic-like behavior and 
increased sympathetic activity pro- 
duced by the hallucinogens has been 
proposed by Brodie and Shore, which 
tends in part to support Hoffer’s 
emphasis upon the role of epinephrine 
and norepinephrine in the etiology of 
schizophrenia. A stricter interpre- 
tation of the Brodie-Shore hypotheses 
would: restrict the effect of the 
tranquilizers and the hallucinogenic 
compounds to levels of activity, i.e., 
sedation, wakefulness, without any 
reference to psychotic behavior. For, 
at no time do the authors attempt to 
systematically relate the increased 
wakefulness resulting from excessive 
stimulation of adrenergic activity to 
psychotic or psychotic-like states. 
The only relationship that may be 
logically deduced from their writing 
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is that psychotics are wide-awake. 

Although the foregoing hypotheses 
regarding the role of serotonin in 
mental function seem attractive, 
there are some cogent criticisms 
which adherents of the serotonin 
paradigm have yet to answer. A close 
analogue of LSD, 2-bromo-D-lysergic 
acid diethylamide (Brom LSD) was 
found by Cerletti and Rothlin (1955) 
to be as effective as LSD in antago- 
nizing several peripheral actions of 
serotonin, yet this antagonist of sero- 
tonin was found to be devoid of hal- 
lucinogenic action in humans. It 
should be noted that chlorpromazine 
is also a potent antiserotonin and 
instead of being hailucinogenic is a 
powerful antihallucinogen. 

Before proceeding to a brief de- 
scription of still another recent bio- 
chemical theory, it would serve well 
to re-emphasize how important a 
role the hallucinogens, specifically 
LSD and mescaline, have played in 
the two previous biochemical formu- 
lations of schizophrenia. The pro- 
ponents of each theory have bor- 
rowed freely from each _ others’ 
research efforts to substantiate hy- 
potheses or propose biochemical 
nuances. This cross-fertilization im- 
plies that we have two distinct 
biochemical approaches which ulti- 
mately will coalesce into a unified 
biochemical theory of schizophrenia. 
This hope for the future is somewhat 
mitigated by a consideration of the 
clinical effects of LSD and mescaline 
which appear to be significantly dif- 
ferent. Thus Matefi (1952) employ- 
ing himself as a subject to study the 
effects of LSD and mescaline reported 
that they produced different psycho- 
pathologic reactions; the former, one 
of hebephrenic type and the latter 
catatonic. Drawings produced under 
the influence of LSD showed a tend- 
ency to expansion, while the ‘‘mes- 
caline pictures’ showed constriction. 
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Fischer, Georgi, and Weber (1951) 
also reported that LSD produced a 
predominantly hebephrenic state, 
whereas during mescaline intoxica- 
tion catatonic features were out- 
standing. From such experiments in 
which subjects receive both drugs, it 
is a little difficult to see how pre- 
sumably the same biochemical im- 
pairment can produce different ex- 
periences within the subject which 
are also significantly differentiable by 
an observer. These clinical observa- 
tions lead to the obvious suggestion 
that the psychotic-like experience 
elicited by the hallucinogens is not 
homogeneous. Furthermore, as the 
behavioral concomitants of these 
toxic states are clearly differentiable, 
we may be dealing with two distinct 
biochemical processes, two distinct 
detoxification processes that are set 
off by various classes of chemical 
compounds. In short, considering the 
lack of identity between schizophrenia 
and the toxic states produced by 
sundry hallucinogens and the possi- 
bility of heterogeneity in mental dys- 
function as specifically related to the 
class of compound administered to 
healthy people, the author may not 
have been considering biochemical 
theories of schizophrenia but rather 
biochemical theories of chemical psy- 
choses. Future studies will un- 
doubtedly clarify some of these issues. 


INTERFERENCE IN ACETYLCHOLINE 
METABOLISM 


In addition to Hoffer, Pfeiffer and 
Jenney (1957) have emphasized the 
role of brain acetylcholine as a pos- 
sible etiological factor in schizo- 
phrenia. In contrast to Hoffer who 
believes that schizophrenia is charac- 
terized in part by excessive central 
parasympathetic (cholinergic) ac- 
tivity, Pfeiffer and Jenney have pre- 
sented evidence which supports the 
view that schizoph.enics are deficient 
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in acetylcholine. They base their 
argument on the facts that: (a2) many 
tranquilizers have persistent acetyl- 
choline-like effects; (6) drugs which 
show the nicotinic properties of 
acetylcholine, e.g., di-isopropyl flu- 
orophosphate, make schizophrenics 
worse; and (c) drugs which show 
muscarinic effects, e.g., arecoline, 
make them better. In their study in- 
volving 23 schizophrenics they ad- 
ministered arecoline, a parasympa- 
thetic stimulant that passes the 
blood-brain barrier freely and leads 
to pronounced cholinergic effects, 
and methyl atropine nitrate, a 
quaternary analogue of atropine that 
protects against the peripheral effects 
of parasympathetic stimulants. Re- 
markable changes, it is claimed, en- 
sued within 1-2 min. and lasted for 
15 min. During a lucid interval which 
persisted for 15 min., the patients 
became more talkative, showed 
greater insight and sociability. 
Recently, Rubin (1958a; 1958b) 
has reported the results of an experi- 
ment that may help to resolve the 
manifest contradiction between the 
theoretical formulations of Hoffer and 
Pfeiffer. It was shown that the 
hydrolysis rate of acetylcholine by 
human erythrocyte cholinesterase 
was significantly different between 
healthy people and ps,chotics. Fur- 
thermore, it was demonstrated that 
the psychotics were distributed be- 
tween two discrete groups, each of 
which differed significantly from the 
normals with respect to the kinetics 
of enzymatic activity. One group 
hydrolyzed - the substrate more 
rapidly than the normals while the 
other group hydrolyzed it more 
slowly. As rapid hydrolysis reduces 
the concentration of free acetyl- 
choline, the former group may be 
assumed to be deficient in acetyl- 
choline, and therefore corresponds to 
the defect postulated by Pfeiffer. 
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Defective enzymatic hydrolysis of 
acetylcholine in the latter group of 
patients may be characterized by 
excessive cholinergic activity, and 
they would deviate from Pfeiffer’s 
model while congruent with Hoffer’s 
hypothesis. 


SUMMARY 


A brief review of several outstand- 
ing empirical studies suggests that 
schizophrenia is characterized by 
some disordered metabolic response 
to stress which in turn is dependent 
upon a neurohumoral or enzymatic 
defect. The three outstanding the- 
ories described have been derived 
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from considerations of the chemical 
and pharmacological properties of the 
hallucinogenic or psychotomimetic 
agents. The fruitfulness of these 
theories and their biochemical con- 
structs seems to be dependent upon 
the validity of the basic underlying 
equivalence relationship presumed to 
exist between the model psychosis 
produced by psychotomimetic agents 
and the pathological state(s) char- 
acterized as schizophrenia. The in- 
tense activity of researchers in this 
area should soon provide the rele- 
vant information required for evalu- 
ation of the basic assumption and 
possibly an effective chemotherapy 
for the schizophrenias. 
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Statistical interaction has _ been 
most commonly defined as a measure 
of the joint effect of p variables upon 
performance. If a scaled independent 
variable is involved, the interaction 
sum of squares may be further ana- 
lyzed, yielding more information 
than the above definition would indi- 
cate. This analysis was first pre- 
sented by Alexander (1946), and 
subsequently extended by Grant 
(1956). This paper will present a 
complete analysis of the (a—1)(b—1) 
degrees of freedom (df) involved in 
the interaction of two independent, 
scaled variables, A and B. The pur- 
pose is twofold: to extend the present 
state of comprehension of the inter- 
action term and to introduce a meth- 
od which will permit inferences about 
the rate of change of slope and curva- 
ture coefficients. 

Grant has shown that if B is a 
scaled independent variable, with 
equal steps (or equal logarithmic 
steps), the (b—1)(a—1) df can be an- 
alyzed into b—1 components, each 
ona—1df. For example, if Ss in each 
of 3 groups (a=3) are tested on 5 
consecutive days (b=5), one could 
compare the slopes of the curves for 
the three groups (linear component), 
the quadratic components of the 
three curves, the cubic components, 
and lastly the quartic components. 
Each of the four (b—1) mean squares 
would be on two (a—1) df. The ap- 
propriate error terms for these com- 
parisons have also been presented by 
Grant. 

If A and B are both scaled vari- 
ables there are a number of alterna- 
tive ways of analysing the data. We 
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could, as above, compare the A 
curves over the levels of B. Alterna- 
tively, the B curves might be com- 
pared over the values of A. A more 
efficient analytical approach exists 
which will provide more information 
than can be obtained from perform- 
ance of both the previously suggested 
analyses, and which will also shed 
light on the relation between the two 
ways of graphing the data. 

The proposed analysis stems from 
one basic fact: if both variables are 
scaled, a sum of squares on 1 df can 
be computed for each of (a—1)(b—1) 
components of the interaction sum of 
squares. Table 1 presents a set of 
data which has been analyzed in this 
manner. The design is a 5X3 fac- 
torial, with four entries in each of the 
15 cells. The analysis may be ap- 
proached in either of two ways. The 
B curves may be compared with re- 
spect to each of a—1 components; 
each sum of squares is then further 
analyzed into b—1 components. This 
has been done in the left half of Table 
2. Alternatively, the A curves may 
be compared with respect to each of 
b—1 components, followed by fur- 
ther analysis of each sum of squares 
into a—1 components. This ap- 
proach is presented in the right half 
of Table 2. The actual choice of ap- 
proaches is of no consequence, since 
the same information can be ob- 
tained from either half of Table 2. 

Consider the calculations for the 
left half of Table 2. The interaction 
sum of squares, computed in the 
usual manner, is 151.067. Further 
analysis requires reference to tabled 
values of orthogonal polynomials 
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TABLE 1 


DaTA AND COMPUTATIONAL AIDS FOR THE ANALYSIS 
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Totals : 3: 2 —119 103 
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(Fisher & Yates, 1953). 
values are: 


For 4 df the 


Zi= 2 —-! 0 +1 +2 
Z:2+2 -1 -—-2 -1 +42 
Z3=-—-1 +2 Oo —2 +41 
Za=+1 -—-4 +6 -4 +1 
For 2 df the values are: 
X,=-1 0 +41 
Xe=+1 -—-2 +1 


The subscript ‘‘1"’ refers to linear, 
“*2"" to quadratic, ‘3’’ to cubic, ‘'4” 
to quartic. The sum of squares for 
the comparison of the linear compo- 
nent of the B curves may now be cal- 
culated. 


b n a 
SS B-1inear = ; ( Zz, > X 
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In our example 
SS p—tinear (1) 
= [(—2)(—10)+(—1)(—15) 
+(0)(—17)+(1)(—39) 
+ (2)(— 38) ]*/(4)(10) (2) 
= 80.000 


The SSpg_tinearc2) Would be calculated 
by using value of Zs, rather than Z,, 
in Equation [2]. Thus 


SSp—tinear (2) 
= [(2)(—10)+(—1)(—15) 
+(—2)(—17)+(—11)(— 39) 
+ (2)(—30) }?/(4)(14)(2) =0.571 


-(LEL vw) Je Dx [1] 


where 

1 refers to scores within cells, 
=1,2, » nt. 

sins ede el ®, j=1, 2, , 
b. 

k refers to levelsof A,k=1,2,---, 
a. In our example »=4, a=3, and 


b=5. The calculations are 

SS p—tinear = [(- 10)?+(- 15)?+(- 17)? 
+(—39)*+(—38)*]/(4)(2) 
—(—119)2/(5)(4)(2) =93.350 

This quantity is on 4 df and may 

now be analyzed into 4 components 

each on a single df. We have called 

these components B-linear (1), B- 


linear (2), etc. The calculations fol- 
low. 


Values of Xo, rather than X,, 
are utilized in the calculations of 
SSp—quadratie and its components. 


SS p—quadratic 
= [(—2)2+(7)?+(23)*+(45)? 
-+ (30)?]/(4)(6) — (103)2/(5) (4) (6) 
= 57.717 
SS B—quaivetic: (1) 
= [(—2)(—1)+(—1)(7) +0) (23) 
+-(1)(45) + (2) (30) ]2/(4) (6) (10) 
= 43.350 


By substituting values of Zs, Zs, 
and Ze for Zi, SSp—eusdretict)s 
SSp- quadratic(3), and SB—guadvatiet®) can 
be computed. 





b n a 2 
SS p—tinear (1) = ( ) a DiZsXiVun) n> 22>) Xi [2] 
‘i i s 
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TABLE 2 





Sum of 
Squares 


Source df 





Two APPROACHES TO THE ANALYSIS OF THE INTERACTION 





Sum of 


Source df Squares 





AXB 151.067 


AXB 151.067 





B-linear 


80.000 


B-linear (1) 


A-linear 


123.350 





A-linear (1) 80.000 





B-linear (2) 0.571 





B-linear (3) 





B-linear (4) 


B-quadratic 
B-quadratic (1) 


B-quadratic (2) 





B-quadratic (3) 


B-quadratic (4) 


The mean squares on single d/’s are 
measures of changes in slope and 
curvature of the B curves, over the 
levels of B. As an illustration, as- 
sume that we are interested in deter- 
mining the function which describes 
the relation of dark adaptation rate 
(i.e. the slope of the dark adaptation 
curve) to preadapting light intensity. 
If we let B stand for intensity and A 
for trials, the SSp—tinearay Would en- 
able us to test the hypothesis that 
rate of dark adaptation shows a 
linear decrement with increases in in- 
tensity. The SSp-tinear(2) would en- 
able us to test the hypothesis that 
the function relating rate of dark 
adaptation to intensity hasa quadrat- 
ic component. The SSg-~guadratien) 
term is a measure of the extent to 


A-linear (2) 43.350 





A-quadratic 


5.821 





A-quadratic (1) 


A-quadratic (2) 


0.571 





5.250 


13.067 





A-cubic 





A-cubic (1) 


5.000 





8.067 


8.829 


A-cubic (2) 


A-quartic 2 


A-quartic (1) 








A-quartic (2) 





which the plot of the quadratic co- 
efficient of the dark adaptation curve 
shows a linear change as a function 
of intensity. The Grant analysis 
yields a test of the hypothesis that 
the slopes (or quadratic, cubic, etc. 
components) of a number of curves 
do not differ. If they do differ, the 
analysis under discussion yields in- 
ferences about the way in which the 
component changes from curve to 
curve. 

The analysis of the data in Table 1 
was carried out by first computing 
the B-linear and B-quadratic compo- 
nents, then analyzing each of these 
into four components. We might 
have, as easily and meaningfully, 
computed four A components, then 
analyzed each of these into two com- 
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ponents. The results of these anal- 
yses are shown in Table 2. Note 
certain relationships between the two 
computational approaches. For ex- 
ample, B-linear (2) is equal to A- 
quadratic (1), B-quadratic (4) is equal 
to A-quartic (2). The jth component 
of the kth component of A will always 
equal the kth component of the jth 
component of B. Thus tests of hy- 
potheses about the B-curves may be 
generated after an analysis of the A- 
curves by simply regrouping the 
single df components, and then add- 
ing. For example, B-linear is the 
sum of A-linear (1) and A-quadratic 
(1). 

A discussion of error terms still re- 
mains. There are three possible cases: 
I. Each S is measured once under 
only one combination of A and B. 
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II. There are different Ss at each lev- 
el of B, but all Ss are measured at all 
levels of A (or B may be the within- 
Ss variable, and A the between-Ss 
variable). III. All Ss are measured 
under all combinations of A and B. 

Table 3 presents the error terms 
(denominator of F) for all interaction 
components, in each case. 

Case I. The error term for all 
terms in Table 2 is the within-cells 
mean square. This value is 1.267. 
There are ab(n—1) df or, in this case, 
45. The within-cells sum of squares is, 
as always, the total sum of squares 
minus the between-cells sum _ of 
squares. 

Case II. Assume that each row in 
Table 1 represents a different S. The 
AB interaction is tested against 
a Ss A/B (subjects-by-A-within-B) 


TABLE 3 
Error TERMS FOR THREE DESIGNS 











Case II 




















Case I Case III 
Numer- Denomi- ; 
ator nator Numerator of F Denominator of F es anal Denominator of F 
of F of F 
Allterms Within AB AB SAB (.538) 
in Ta- cells — 
ble 2 (1.267) A-linear B-linear error—B.-linear (.738) 
A-quadratic }Ss XA/B (.550) B-quadratic error—B-quadratic (.340) 
A-cubic A-linear error—A-linear (.197) 
A-quartic A-quadratic error—A-quadratic (.903) 





B-linear and all 
its components 


A-linear (1) 
A-quadratic (1) 
A-cubic (1) 


A-quartic (1) 





error-linear (.742) 


A-cubic error—A-cubic (.456) 





A-quartic error—A-quartic (.600) 





B-linear f1}) 
A-linear (1){ error—B-linear (1) (.333) 


B-linear (2) \ 
A-quadratic (1)/error—B-linear (2) (1.000) 








B-linear oO} 
A-cubic (1) error—B-linear (3) (.833) 





B-quadratic and all 
its components 


A-linear (2) 


A-quadratic (2) 
A-cubic (2) 





A-quartic (2) 


error-quadratic (.358) 


B-linear (4) } 
A-quartic (1)/ error—B-linear (4) (.783) 





B-quadratic (1) 


A-linear (2) ferror—B-quadratic (1) (.061) 





B-quadratic (2) 
A-quadratic (2) Jerror—B-quadratic (2) (.806) 





A-cubic (2) error—B-quadratic (3) (.078) 





B-quadratic (4) 


} 
B-quadratic sa | 
A-quartic (4) } 


error—B-quadratic (4) (.417) 











INTERACTION OF TWO SCALED VARIABLES 


-error term on b(m—1)(a—1) df (30 
df). The computational formula is: 


SSssxa/B = SS Total —SSss—SSa—SSap 
= 16.500 [3] 


The mean square of .550 is the error 
term for A-linear (2 df), A-quadratic, 
A-cubic, and A-quartic, as well as for 
the AB interaction. 

The linear component of 
(error-linear) is computed from 


error 


RY rror—linear 


-EX(LMru) / Coxe 


= [|(—4)?+(—3)?= -- - +(—10)?+ 
—[(—10)2(—15)?+ - - 
= 11.125 


The corresponding mean 
(df = b(n—1) =15) is .742. 
error term for B-linear and all its 
components, and for A-/linear (1), 
quadratic (1), A-cubic (1), and A- 
quartic (1). 

The quadratic component of error 
(error-quadratic) is given by Equa- 
tion [4] with values of X2 substituted 
for X;. The sum of squares is 5.375; 
the mean square is .358 on 15 df, and 
is the error term for B-quadratic and 


square 
This is the 


1. —B—linear 


rx(z 


—[(—33)?+(- 
= 8.850 


389 


all its components, and for A-linear 
(2), A-quadratic (2), A-cubtic (2), and 
A-quartic (2). 

Case III. Assume that the first 
row at each level of B (Table 1) rep- 
resents the performance of a single S, 
that the second rows represent a sec- 
ond S, etc. Thus we have four Ss go- 
ing through 15 combinations of A 
and B. Table 4 presents various 
sums of crossproducts which should 
facilitate calculations of the error 





Y (Lo Kw) /aDxe 


eye 
- +(—38)?]/(4)(2) 


[41 





terms for the single df components. 
The error term for AB is SAB, and 
is computed in the usual manner. 
The resulting sum of squares is 
12.933, which, when divided by 
24 (df =(n—1)(a—1)(b—1)) yields a 
mean square of .538. The SAB sum 
of squares may next be analyzed 
into a@—1 components, each on 
(n—1)(6—1) df. In our example we 
have an error-B-linear and an error- 
B-quadratic, each on 12 df. 


} - Vin) /=ux _ x(Lox Vin) n>. X} 
_ o(x > mite) / 1 X+ (> x > > xVen) / nD x; 


K—4)4+-(—3)*4+-(—2)°+(—1)* 
—[(—10)2?+(—15)?+ --- eee (4)(2) 


- +(—10)?+(—8)*]/2 


31)?-+ (—27)?+(—28)?]/(5)(2)-+(—119)?/(5)(4)(2) 


[s] 
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INTERACTION OF TWO SCALED VARIABLES 


The mean square is .738 and is the 
error term for B-linear. The sum of 
squares for the error-B-quadratic is 
obtained by substituting X-2 values in 
Equation [5]. The resulting sum of 
squares is 4.083, and the mean square 
is .340, the latter being the error 
term for B-quadratic. The SAB 
term could also have been ana- 
lyzed into b—1 components, each on 
(n—1)(a—1) df. Equation [5] would 
then contain Z values rather than X 
values, and would yield the sums of 
squares for error-A-linear, error-A- 
quadratic, etc. These in turn would 
provide F tests for A-linear, A- 
quadratic, etc. 

Next consider the error terms for 
B-linear and B-quadratic components. 
The sum of squares for error-B-linear 
(1) is given by 


n 


b a 
SS error ~B—linear (1) = > ( 2 > ZX, Y 
J k 
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example. This indicates that error- 
A-linear, error-A-quadratic, etc. can 
be obtained by adding appropriate 
error-B components. For example, 
error-B-linear (1) and error-B-quad- 
ratic (1) should equal error-A-linear. 

The analyses described permit in- 
ferences about such matters as the 
rate of learning as a function of 
amount of practice, the rate of dark 
adaptation as a function of pre- 
adaptation intensity, or the rate of 
extinction as a function of number of » 
conditioning trials. While such infor- 
mation may appear meaningful and 
useful, the reader may wonder if tests 
of quantities such as B-quadratic (4) 
are of any utility. To this, it may be 
answered that the complete single df 
analysis should extend our under- 
standing of the interaction term. 


w) j _ X; > 237 —SS p-tinear ql) 


=[(—19)2+ - ++ + (—23)?]/(2)(10) — 80.000 [o] 


= 1.00 


The mean square on »—1 (=3) df is 
.333 and is the error term for B-linear 
(1). The remaining error components 
(mean squares) are found in Table 3, 
aud were calculated in similar fashion, 
by substituting appropriate values of 
X and Z in Equation [6]. Relation- 
ships previously pajnted out with re- 
spect to the AB interaction hold for 
SAB. Thus, error-B-linear (2) is 
identical to error-A-quadratic (1), for 


Furthermore, calculation of addi- 
tional terms provides computational 
checks for those components which 
are of major interest. Finally, it is 
hoped that the information obtained 
from such analyses will form some 
basis for more quantitative, specific, 
and accurate predictions about _be- 
havior than our present theories gen- 
erally yield. 
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MULTIPLE 


COMPARISONS IN ANALYSIS OF VARIANCE 


JOHN GAITO ‘ 
Wilkes College 


Recently Ryan (1959) has provided 
a valuable service for psychologists 
by presenting newer techniques for 
making multiple comparisons after 
the analysis of variance F test has 
rejected the over-all null hypothesis 
that the means are equal. Some of 

~ these techniques previously were not 
available to psychologists because 
they were discussed in scattered 
sources. However, Ryan does not 
mention the fact that the analysis of 
variance technique does provide a 
means of making individual com- 
parisons (after rejection of the null 
hypothesis) which is suitable for 
many situations. This procedure 
involves partitioning the n degrees of 
freedom for the main effect into n 
orthogonal components, each with 
one df (Edwards, 1951; Senders, 
1958; Snedecor, 1946) and is suitable 
for either one variable or multiple 
variable designs. 

As an example of this procedure 
let us take the case in which we have 
three groups of Ss. Group 1 is a con- 
trol; Groups 2 and 3 are experimental 
groups. Inasmuch as three groups are 
involved, two df are present. There- 
fore, we can partition the sum of 
squares with two df into two single 
components, each with one df. The 
meaningful comparisons in this situa- 
tion would be G, vs. G2+Gs, and Ge 
vs. G3. 

As another example let us have two 
experimental groups (E; and Es) and 
a control group for each (C; and C2). 
Here we have a sum of squares for 
between groups with three df. We 
partition these three df into three 
single components, each with one df. 
The experimental hypotheses would 
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determine which comparisons would 
be relevant. In this case it appears 
that the investigator would be con- 
cerned with a comparison between 
(C,+E,) and (C2.+E2) for one df. A 
second df would involve the com- 
parison, C; vs. E,; the third df, Ce vs. 
E». Each of these three tests would 
be by the F ratio, the denominator 
being that which is provided by the 
analysis of variance model and would 
be the same for all three tests. This 
procedure allows one to make use of 
the power of the variance analysis 
technique (a_ well-explored robust 
procedure) rather than to develop 
new procedures. Thus the proba- 
bility statements would be exact and 
the error rate per comparison would 
be that of the probability level used 
for the tests of significance, as is 
usual with the analysis of variance 
procedure. This would maintain the 
error rate per experiment at a lower 
rate than would occur if all possible 
comparisons were effected. 

With the partitioning procedure it 
is important that the orthogonal com- 
parisons be planned before the experi- 
ment is conducted. In the second 
example, if the investigator had not 
planned the comparisons, he might 
be tempted to make different or- 
thogonal comparisons than those 
above after he looks at the data. For 
example, he might compare (E,+E2) 
and (C,+C.), E; vs. Ee, and C,; vs.Coe. 
However, which orthogonal «ompari- 
sons to be effected will be de- 
termined by the experimental hy- 
potheses. (For a simple means of de- 
termining orthogonality, see Senders, 
1958.) 

Using this procedure, Ryan’s Case 


re 
aie 











MULTIPLE COMPARISONS IN ANALYSIS OF VARIANCE 


1 (multiple comparisons) reduces 
essentially to his Case 3 (multiple 
variables in analysis of variance). A 
single variable might be involved but 
the partitioning into » components 
gives the analysis the appearance of a 
multiple variable design, according 
to procedure in making tests of sig- 
nificance. Thus, two or more F ratios 
will be involved and, in a strict sense, 
these ratios will not be independent 
because a common error estimate will 
be used. However, this lack of inde- 
pendence is irrelevant. The impor- 
tant point is that the numerator and 
denominator of the F ratio be inde- 
pendent. Each F ratio is based on 
the mathematical model (the F dis- 
tribution) and has all the power of 
that model. 

The above examples indicate the 
suitability of the partitioning pro- 
cedure for many cases. However, in 
some experiments the partitioning 
procedure will not be completely 


393 


efficacious inasmuch as all compari- 
sons which are required by the ex- 
perimental hypotheses may not be 
orthogonal. Yet if most comparisons 
are orthogonal ones it would appear 
that little loss of exactitude would 
occur with few nonorthogonal com- 
parisons. Sometimes the results of 
the orthogonal comparisons will pro- 
vide indirect information concerning 
the relative ranking of all different 
groups, thus obviating the need for 
making nonorthogonal comparisons. 
But in this situation the new multiple 
comparison techniques which Ryan 
discusses might be used. However, it 
should be pointed out that a trend 
analysis (Lindquist, 1953) might be 
more appropriate if the groups differ 
quantitatively (or qualitatively if the 
groups can be ordered). However, a 
regression analysis may be more 
meaningful if the groups differ in an 
orderly quantitative manner. 
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COMMENTS ON ORTHOGONAL COMPONENTS 


T. A. RYAN 
Cornell University 


Professor Gaito is to be thanked 
for pointing out a method for making 
multiple comparisons which was not 
discussed in my paper. Long as that 
paper was, it still could not cover 
all of the possible approaches. The 
method of orthogonal components 
which Gaito proposes has, however, 
several serious drawbacks. Moreover, 
the power of the method is little, if 
any, greater than that of other meth- 
ods which are jot subject to these 
difficulties. 

The following points must be kept 
in view in evaluating the method ad- 
vanced by Gaito: 

1. By restricting ourselves to or- 
thogonal comparisons, we are pre- 
vented from making comparisons 
which may be important in under- 
standing and interpreting the results. 
In his first example involving two ex- 
perimental groups and a single con- 
trol, Gaito asks two questions: (a) 
whether the two experimental groups 
differ jointly from the control group, 
and (b) whether the experimental 
groups differ from each other. If we 
ask these two questions we cannot 
then ask which of the experimental 
groups differ from the control. 

2. When the number of groups is 
larger than that in Gaito’s example, 
we are even more restricted in the 
questions which we can ask. For 
example, if there are 5 experimental 
groups and 1 control, we have a total 
of 5 degrees of freedom and therefore 
5 orthogonal comparisons are possi- 
ble. There are only 4 degrees of free- 
dom for question (6), so we can make 
only 4 of the 10 possible comparisons 
among the experimental groups. 
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Dunnett (1955) has provided a 
method for comparing each experi- 
mental group with a control while 
controlling the error rate experiment- 
wise. This method makes the same 
number of comparisons as the method 
of orthogonal components, but the 
comparisons it does allow are more 
meaningful for many common situa- 
tions. If we use Dunnett’s method 
we cannot find out which of the ex- 
perimental groups differ from each 
other. To ask these questions brings 
us to the situation where we make all 
possible comparisons among the 
means, and a method like Tukey’s is 
the most appropriate. 

3. The relative power of two meth- 
ods of comparison can be determined 
adequately only if the error rates are 
computed on the same basis. Dun- 
nett’s and Tukey’s methods are de- 
signed to control the error rate 
experimentwise. To compare these 
with the method of orthogonal com- 
parisons it is necessary to determine 
the experimentwise error rate for the 
latter. When this is done, as has been 
illustrated in Table 1, we find that 
the Dunnett method actually re- 
quires slightly smaller differences tor 
significance than does the method 
proposed by Gaito.' (We must bear 
in mind, of course, that the Dunnett 
method is not comparing the same 


1 In order to determine allowances for the 
method of orthogonal components in Table 1, 
the F (or ¢) value was determined at the 
tabled ‘'.025 level.”” Since there are two com- 
parisons in the example, the error rate per 
experiment is twice the nominal level of the 
single tests, or .05. The experimentwise rate is 
only slightly less (.04994). 








COMMENTS ON ORTHOGONAL COMPONENTS 


TABLE 1 


COMPARISON OF ALLOWANCES FOR 5% LEVEL EXPERIMENTWISE OR PER 
EXPERIMENT, Two EXPERIMENTAL GROUPS, AND ONE CONTROL 


F* for 
Orthogonal 
Components 


df for Error 





15 , 
30 : 


o 


0 
57 
2 


Critical ¢ for 
Critical ¢ for Tukey Method*** 
Dunnett -—— — 
Method** Adjacent 
Means 





Extremes 


46 < 2.62 
32 2.47 
21 2.35 





2.41 
2.29 
2.17 


* Read from F tables for ‘'.025 level’’ which gives .05 level per experiment, with two comparisons. 


** Allowance for comparing each of two experimental means w 


ith control mean, at .05 level experimentwise. 


*** For all comparisons at .05 level experimentwise, computed from Tukey's tables. 


pairs as the method of orthogonal 
comparisons, so we are comparing the 
power of one in making its compari- 
sons with the power of the other in 
making its own.) Tukey’s method, 
which permits us to make all possible 
comparisons of pairs of means, re- 
quires a somewhat greater gap when 
we are comparing the extreme means, 
but a smaller separation for adjacent 
pairs. 

4. When it has been decided in 
advance that only certain compari- 
sons are of interest, and that other 
comparisons will never be made un- 
der any circumstances, multiple com- 
parison procedures can be adapted to 
control the error rate per experiment. 
To do so, it is not necessary to ensure 
that all comparisons are orthogonal. 
The error rate is simply the product 
of the number of comparisons to be 
made and the nominal significance 
level of the individual tests (the error 
rate per comparison). This relation 
is not affected by the lack of inde- 
pendence of the comparisons (Ryan, 
1959, p. 39). 

Allowances based upon experiment- 
wise rates may be somewhat smaller 
than those for the corresponding rates 
per experiment. This is the reason for 
the difference between Dunnett’s al- 
lowances and those based upon F in 


Column 3 of Table 1. The allowances 
for the Tukey methed would have 
been slightly larger if we had com- 
puted them on the basis of rates per 
experiment. For example, the al- 
lowance for the extremes in the case 
of 15 degrees of freedom would have 
been 2.73 instead of 2.62, but this is 
the largest change which would have 
been made in the table. When we se- 
lect our comparisons, however, a 
method which controls the experi- 
mentwise rate is usually not readily 
available. What we may lose by the 
shift to the rate per experiment may 
nevertheless be compensated for by 
the reduced number of comparisons. 

5. In describing his procedure, 
Gaito has used the error rate per com- 
parison. This is also the rate per de- 
gree of freedom since the number of 
comparisons equals the degrees of 
freedom. In my paper, however, I 
argued that we should control the 
rate of error per experiment or ex- 
perimentwise. I shall not review 
these arguments here, since Gaito 
has not offered any reasons for pre- 
ferring the rate per degree of freedom. 

Even if we were convinced that 
the error rate should be based upon 
the number of degrees of freedom, 
there are other more satisfactory 
methods available. Duncan’s meth- 
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ods (1955) permit us to make all 
possible comparisons with an error 
rate which is proportional to the 
number of degrees of freedom. The 
Duncan test will require slightly 
greater separation of a pair of means 
for significance, in comparison to 
those in Gaito’s method, but this 
loss of power is more than compen- 
sated for by the possibility of asking 
more meaningful questions in our 
analysis of the data. 

Apart from this analysis of the 
method proposed by Gaito, I wish to 
correct a misunderstanding which 
may arise from his first sentence. The 
newer methods for multiple com- 
parisons do not require an initial F 
test of the over-all variation between 
groups. All of the methods men- 


tioned in the present note can be 
applied immediately, without such a 
preliminary test. 

Having raised a number of points 
of disagreement, I can nevertheless 
end with an important area of agree- 
ment, namely the possibility of using 
regression or trend analysis in cases 
where the groups can be ordered or 
classified according to a quantitative 
independent variable. This whole 
approach was not considered in my 
paper because of limitations of space 
and because it is generally much 
better understood than the cases of 
qualitative groupings. At least, in 
regression analysis, the number of 
significance tests does not increase 
geometrically as the number of groups 
or subdivisions goes up. 
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ERRATUM 


In ‘Multiple Comparisons in Psychological 
Research,” Psych. Bull., Vol. 56, No. 1, pp. 
26-47, there is an error in the next to last 
sentence of the appendix. This sentence 
should read: 

“Before M; can differ significantly from 
Ma, both Ms; must be found to be significantly 
different from M;, and M, must differ sig- 
nificantly from Mo.” 


In other words, a given pair of means can- 
not differ significantly in the layer method 
unless all larger subgroups to which it belongs 
are also significant. When a particular sub- 
group is found to have a nonsignificant range, 
no further tests are made within it, just as the 
total range must be significant for any further 
tests. (This point is correctly stated on page 
41.) 























